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Chapter 1 
Introduction 



The objects, laws, and phenomena of Nature have been the subject of physics 
for hundreds of years In the second half of the 20th century, however, 
new interdisciplinary and applied branches of physics were developed that 
merged a wide range of scientific disciplines with physics including economy, 
biology, chemistry and geology. Most of the new branches of physics could 
not have evolved as they did without a specific new technological invention, 
namely computer technology, which developed independent from and paral- 
lel to physics. With the help of computers new research methods became 
available, e.g., time-series analysis, computer simulation, and data mining. 

As the use of computers was spreading across the globe, computers them- 
selves not only became increasingly useful tools for the research community, 
but their evolving network attracted growing academic interest. As a research 
tool the computer become the subject of research itself. In a pioneering work 
by Csabai in 1994 [2] the traffic fluctuations of the then Internet as it existed 
at the time was investigated. The author found that the power spectrum of 
the traffic delays is 1 / /-like, similarly to other collective phenomena, e.g., 
highway traffic. Nowadays, a new interdisciplinary science is forming to ex- 
plore and model complex networks [§], in particular the Internet. 

The Internet is an exceptional example of complex networks in a number 
of aspects. Firstly, the structure of complex networks is often the subject 
of research. The Internet's infrastructure makes it possible to carry out 
measurements on the network cheaply and easily on an incomparable scale. 
Secondly, data traffic runs in the network, which adds another level of com- 
plexity to the system. Thirdly, the Internet is a human engineered physical 
network, which matches the complexity of some biological systems. 
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A useful mathematical abstraction of a network is a graph, because the 
number of elements of real networks is finite. However, the number of el- 
ements of complex networks is too large for the individual consideration of 
each network element. Moreover, the exact principles governing connections 
between different network components are usually unknown. Therefore, one 
should rely on statistical methods, specifically the tools of statistical physics, 
in order to describe the structure of complex networks. 

The optimization of traffic performance has great practical importance. 
The data flows can be regarded as interacting dynamical systems superposed 
onto the network infrastructure. The theory of dynamical systems can there- 
fore prove to be a useful tool for studying network traffic. 

All in all the Internet is an interesting new area of academic research and 
several well established tools of physics can be quite useful for studying it. 
Since the Internet has many layers, a number of different components and, 
moreover, it is in constant development, it would be an impossible task to 
cover all aspects of its operation. Instead, I will concentrate on the dynamical 
modeling of the Transmission Control Protocol (TCP), the most important 
traffic regulatory algorithm of the current Internet. After the introductory 
chapter where the most important concepts of the Internet are introduced I 
begin my survey with the investigation of TCP operating on an elementary 
network configuration: a single buffer serving a link. This scenario comprises 
the building blocks of Internet traffic. I proceed further with the refinement 
of the first model, and I consider the finite storage capacity of routers in 
the next chapter. After the analytic and simulation study of the previous 
elementary single buffer models a more complex model of the Internet follows. 
Specifically, in the last chapter I examine the problem of efficient capacity 
distribution in a growing tree-like network. 

1.1 The Internet 

1.1.1 The short history of the Internet 

The price and sheer size of the first computers restricted their applicability 
in the military and academic sphere. Motivated by the military needs of 
the United States in the cold war era a novel concept, the theory of packet- 
switching, was proposed by the Advanced Research Projects Agency (ARPA) 
to connect distant computers in a decentralized manner. The concept of 
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packet-switching means that, contrary to connection-based circuit-switching, 
resources are not reserved for communication between host and destination, 
but data is split into small datagrams which are transmitted through the 
network individually. The first physical network was constructed in 1969 be- 
tween four US Universities: the University of California Los Angeles, Stan- 
ford Research Institute, University of Utah and University of California Santa 
Barbara. This small network, called ArpaNet, is commonly perceived as 
the origin of the current Internet. Over the course of the following years the 
network grew gradually and connected more and more universities. By 1981 
the number of hosts had grown to more than 200. 

Based on ARPA's research, and that of its successor DARPA the In- 
ternational Telecommunication Union (ITU) started developing the packet- 
switched network standards. In 1976 the ITU standard was approved as 
X.25, and provided the basis of the international and public penetration of 
packet switched network technology. Using the X.25 and related standards, 
a number of industrial companies created their own networks. The most no- 
table was the first international packet-switched network, referred to as the 
International Packet Switched Service (IPSS). In 1978 IPSS was launched in 
Europe and the US with the collaboration of the British Post Office, Western 
Union International and Tymnet. By 1981 it covered Europe, North Amer- 
ica, Australia and Hong Kong. The X.25 standard also allowed the commer- 
cial use of the network, as opposed to ArpaNet, which being a government 
founded project restricted its use to military and academic purposes. 

In the first packet switched networks the network infrastructure itself 
assured reliable packet transfer between hosts. This approach made it im- 
possible to connect different networks with different network protocols. In 
order to overcome this difficulty a novel concept of internetwork protocol, the 
TCP, was developed. With TCP the differences between different network 
protocols were hidden and the hosts became responsible for the reliability of 
the data transfer. The first specifications of TCP were given in 1974. After 
several years of development and testing the TCP standards were published 
in 1981. This paved the way for the current Internet. Since then every sub- 
net of the Internet has adopted TCP. A detailed introduction to the protocol 
will be presented in the next section. 

The pure network infrastructure would have been useless without user 
applications. The basis of many early Internet applications was Unix to 
Unix Copy (UUCP), developed in 1979. The most notable services using 
UUCP were electronic mail, Bulletin Board Systems (BBS) and Usenet News. 
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At the dawn of the Internet era the most important service was, without 
doubt, email. Most of the early Internet traffic was generated by emails, but 
even in recent years email constitutes a significant share of Internet traffic. 
BBS and Usenet services were popular among home users with slow modem 
connections who did not have direct Internet connections. Messages, news, 
articles, programs or data could be uploaded and/or downloaded after the 
user dialed into a server. BBS and Usenet servers then periodically exchanged 
data via UUCP. 

By the beginning of the 1990's BBSs and Usenet had declined in im- 
portance, mainly due to the new information medium, the World Wide 
Web (WWW). The WWW was born of the merging of the Internet and 
the paradigm of hypertext in the European particle physics laboratory, the 
CERN. The WWW started conquering the Internet after the debut of the 
Mosaic web browser in 1993. 

The Mosaic browser was such an enormous success that it even affected 
the development of the Internet itself. The Internet crossed the borders of the 
academic and industrial research domain and opened up to the wider public. 
The process was accelerated by rapid technological advances in computer 
technology that made personal computers a part of people's everyday lives. 
The combined effect of the above led to the Internet boom in the 1990's, 
when a whole new industry formed around the Internet. 

By now the Internet has expanded even further than computers. Internet 
telephony (Voice over IP), mobile Internet (GPRS, UMTS), web cameras, 
wireless networks, personal digital assistants (PDAs), and sensor networks 
are a few examples of the current trends. The new technologies make both 
the structure and the traffic of the Internet more and more complex. I review 
these issues in the following. 

1.1.2 The structure of the Internet 

Since the development of the Internet was not regularized by any central au- 
thority and it has been influenced by a number of random effects the structure 
of the network is highly irregular. Nevertheless, the Internet can be divided 
into smaller segments, called Autonomous Systems (ASs). Each AS is ad- 
ministered by a separate organization, e.g. a university, an Internet Service 
Provider (ISP), or a government, and is usually organized in hierarchical, 
tree-like structure. ASs are connected to one another via the Internet back- 
bone. The Internet backbone is built from high capacity links, currently up 



1.1. THE INTERNET 



5 



to a couple of lOGbps. On the other end of the hierarchy end users connect 
to the network. The available bandwidth for end users can be in the range 
of 56Kbps modems to 20Mbps business ADSL. If we consider AS as the unit 
of the network, and interconnections between them as links, then we speak 
of AS level topology. 

Internet also can be viewed on a much smaller scale consisting of two basic 
components: nodes and links. Nodes are devices, e.g. computers, cell phones, 
PDAs, routers, switches or hubs, and links are connections between them, 
e.g. cable (Ethernet, optical fiber), radio (WiFi, Bluetooth), infrared (IrDA), 
or even satellite connections. Those nodes which have multiple connections 
must decide in which direction they forward the through traffic. These nodes 
are usually referred to as routers. This detailed view is called the router level 
topology. 

Internet topology has been studied both on AS \4, |5fl and router level 
@, 0, 0, @|- On both level the Internet can be modeled as a graph from 
graph theory. One of the most fundamental quantities used for describing 
the structure of a graph is the degree sequence, which is to say the number of 
the neighbors of nodes. It has been found that the distribution of the degree 
sequence follows a power law P{k) ~ k~ s on both level. The appearance of 
a power law indicates the scale-free nature of a particular object, so a graph 
the degree distribution of which follows power law is called scale-free graph. 
Note that in recent years the statistical properties of other scale-free networks 



have been investigated by the physics community as well [ICJ, lUl, LL2I, LL3 
Examples of such networks vary from social interconnections and scientific 
collaborations [1J] to the WWW [15 . 

Several projects have been launched over the past decade in order to 
map the Internet topology. For example, the Macroscopic Topology Mea- 
surements project of CAIDA, a research group located at the University of 
California San Diego, surveys the Internet continuously with probe pack- 
ets from a couple of dozen monitoring hosts. The visualization of the AS 
level map produced by CAIDA is shown in Fig. 11.11 Rocketfuel is a Internet 
mapping engine, developed at the University of Washington, which aims at 
discovering ISP router level topologies The engine makes use of rout- 
ing tables to focus measurements to certain ISPs, exploits the properties of 
Internet Protocol (IP) routing to eliminate redundancy, and uses data from 
nameservers in order to classify routers. 

It should be noted that the known Internet topology is only a sample 
of the real one. The surveyed topology is obtained from measurements, 
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Figure 1.1: Internet AS level topology collected between 4-17 April 2005 by 



CAIDA 16]. The angular position of nodes corresponds to the geographical 
longitude of the AS headquarters. The radial position is calculated from the 
out-degree of ASs. 



mostly via a program called traceroute. The program can discover routes 
between the traceroute source and given destination hosts. Since the num- 
ber of sources is limited only a section of the real network can be visible in 
one experiment. It is therefore questionable whether the observed topology 
resembles the actual Internet topology. Recently it has been shown that 
a traceroute-based experiment can produce strong bias towards scale-free 
topology [131 , especially when the number of sources is one or two. Moreover, 
it has been shown that a badly designed measurement can show scale-free 
topology even if the original network is regular 1 1 QQ . 



1.1.3 Traffic on the Internet 



The properties of the Internet traffic are as important as the structure of 
the network itself. Since the time-scale of the evolution of the network in- 
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frastructure is much larger than the time-scale of the traffic flow the network 
infrastructure can be considered as a static background behind the traffic dy- 
namics. In comparison with the changes in the network traffic the dynamic 
changes in the network structure can be neglected. 

The Internet traffic is governed by communication protocols, which can 
be classified into separate abstract layers according to their functionality. 
Each layer takes care of one or more separate tasks of data transfer and 
handles data towards a lower or an upper layer. User applications usually 
communicate with the topmost layer, whilst the lowest layer deals with the 
physical interaction of the hardware. 

The most important classification regarding the Internet is the TCP/IP 



protocol suite [20|, l21|, which includes five or four layers. A more general 
and detailed model is the OSI model, which includes seven layers. The 
concept of layers is quite important, since it provides transparency for user 
applications in a very heterogeneous environment. In order to overview the 
mechanisms behind the Internet traffic let us introduce the four-layer model 
of the TCP/IP suite: 

• The topmost, fourth layer of TCP/IP suite is called Application layer. 
It provides well-known services such as TELNET, HTTP, FTP, SSH, 
DNS, and SMTP. User programs should provide data to an application 
layer protocol in a suitable format. 

• The next layer is the Transport layer, which is responsible among other 
things for flow control, error detection, re-transmission, and connection 
handling. The two most important protocols in this layer are TCP and 
User Datagram Protocol (UDP), which will be discussed in more de- 
tail in Section II .21 They represent two conceptually quite different 
transport mechanisms: TCP provides reliable, connection-based data 
transfer, while UDP serves as an unreliable, connectionless, best effort 
transport mechanism. Other protocols at this layer are SCTP devel- 
oped for Internet telephony, and RTP designed for real-time video and 
audio streaming. 

• The following layer is referred to as the Internet layer. This layer 
solves the problem of addressing and routing of packets. The IP and 
the obsolete x.25 protocols reside in this layer. IP hides the details of 
the network infrastructure, and allows the interconnection of different 
network architectures. 
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• Finally, the lowest layer of the TCP/IP suite is the Network access 
layer, which handles physical hardware and devices. Notable examples 
on this layer are the ethernet, WiFi, and modems. 

In order to understand the workings of the Internet, let us take the ex- 
ample of a typical Internet application: let us suppose that Alice wants to 
download a file from Bob. Since Alice wants to get an exact copy of the file, 
she starts an FTP session. First, the FTP protocol builds a connection be- 
tween the two computers. Then the file is split into small datagrams, which 
are passed on to the TCP protocol on Bob's computer. The TCP protocol 
adds a header to the datagrams, including a sequence number, a timestamp, 
and some other information which ensures reliability. Then TCP passes the 
datagrams on to the IP protocol, which adds its own header. The IP header 
contains addressing information. The resulting IP packet is put into the out- 
going queue of Bob's computer. If the queue is empty, then the packet is sent 
to the Network Interface Card (NIC), otherwise it has to wait until the pre- 
ceding packets have been served. The NIC card disassembles the packet into 
ethernet frames and puts them onto the physical cable. The frames travel 
to the default router in Bob's network and the router's NIC assembles them 
back into an IP packet. Based on the destination address in the IP header, 
the router decides in which direction the packet should be forwarded and the 
packet is put into the outgoing queue of the corresponding direction. The 
packet is then disassembled and transferred again over the next cable. The 
procedure is repeated until the packet arrives at its final destination. The 
actual method of data transfer on the Network access layer can differ from 
the above mentioned ethernet method. If Alice uses a dial-up connection, for 
instance, the last step of the packet's path is over a telephone line via a mo- 
dem. At Alice's computer the IP protocol takes the packet and passes it on 
to the TCP protocol. The TCP acknowledges the packet and inserts it into 
the missing part of the file. Finally, when Alice's computer has received all 
the pieces of the file, the FTP protocol saves the whole file to its destination 
on her computer. 

Although both packet-switched and connection-based data transfer are 
present in the above example, the Internet is called a packet-switched network 
because the Internet layer, which is the fundamental core of the Internet, 
utilizes solely packet-switched technology. Other layers can be either packet 
or circuit-switched. Ethernet traffic is packet-switched, for example, but 
modem traffic is carried through circuit-switched telephone lines. Higher 
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level protocols (e.g. FTP, TELNET, SSH) are usually connection oriented, 
too. 

Let us study the Internet layer in more detail. First of all, packets are 
injected into the Internet layer randomly by higher level protocols at cer- 
tain source nodes. Then packets are served sequentially and forwarded to 
neighboring nodes by routers or, if they have arrived to their destination, 
removed from the network. If a router is busy serving a packet then any 
incoming packet is placed into a buffer and has to wait for serving. If the 
queue in the buffer has reached the buffer's maximum capacity then all in- 
coming packets are dropped until the next packet in the queue is served and 
an empty space becomes available in the buffer. The event when a buffer 
becomes full is called congestion. The above described router policy, called 
drop-tail, is the most wide-spread nowadays. Other router policies are also in 
use. The Early Random Drop (ERD) and Random Early Detection (RED) 
polices, for instance, drop incoming packets randomly before the buffer be- 
comes fully occupied in order to forecast possible congestion to upper level 
protocols. The difference between the two policies is that the drop proba- 
bility depends on the instantaneous queue length in the former case and the 
average queue length in the latter. It is possible to give priority to certain 
packets in order to provide Quality of Service (QoS) for certain applications, 
but routers usually serve packets in First In, First Out (FIFO) order. The 
serving rate of packets depends on the actual packet size and the bandwidth 
of the link after the buffer. Packets obviously suffer propagation delay during 
their delivery, is a consequence of two factors: link and from queuing delay. 
The former is constant for a given route, but the later varies randomly with 
queue lengths along the packet's path. 

The product of the link delay and link capacity, in short the bandwidth- 
delay product, equals the number of packets that a link can transfer simulta- 
neously. If this quantity is large compared to the buffer size then the constant 
link delay is the dominant constituent of the propagation delay. Wide Area 
Network (WAN) links are typical examples of this. On the other hand, if 
the bandwidth delay product is small compared to the buffer size then the 
varying buffering delay is the dominant component. Such links can be found 
in Local Area Network (LAN). We will see later that the two scenarios induce 
different TCP dynamics. 

It is evident that queuing theory plays an important role in the mod- 
eling of packet-switched networks in general and the Internet in particular. 
However, queuing theory has been developed much earlier than the advent 
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of packet-switching technology. The first motivation and important applica- 
tion of queuing theory was actually a circuit-switched network, the classical 
telephone system. 

The properties of two quantities, namely the inter-arrival and the ser- 
vice times of customers, affect the behavior of queuing systems most fun- 
damentally. Other quantities, e.g. the size of the customer population, the 
number of operators, the system capacity etc., also have an impact on the 
behavior of the system, but they do not affect the essential properties of the 
queuing system. Both the inter-arrival and the service time series can be 
modeled by discrete time stochastic processes. It is usually assumed that 
both the inter-arrival and the service times are independent and identically- 
distributed (IID) random variables. Furthermore, in the most simple case, 
both inter-arrival and service times are memoryless processes, that is they 
are exponentially distributed random variables. This model is called Pois- 
son queue, since both the number of arrivals and the number of departures 
in a finite time interval follow Poisson distribution. Poisson queues have 
been studied extensively and they proved to be excellent models of telephone 
call centers and telephone exchange centers. Most of the arising questions 
regarding Poisson queues have been answered analytically (22| . 

Internet traffic has been analyzed on various layers of the above TCP/IP 
suite. In a pioneering work by Leland et al. 23] the authors collected and 
studied several hours of ethernet traffic with 20-100/is resolution. They 
found that autocorrelations in the captured traffic decayed slower than ex- 
ponential, that is the system has long-range memory. This result indicated 
problems with Poisson queuing models for packet-switched networks, since 



24 



in a Poisson queuing system autocorrelations would fall exponentially 
Furthermore, it has been shown that the time series of the aggregated Eth- 
ernet traffic is statistically self-similar, and has fractal properties. Paxson 
and Floyd [25| studied the usability of Poisson models for application layer 
protocols and the corresponding IP traffic. They found that, though the traf- 
fic followed a 24-hour periodic pattern, Poisson processes with fixed arrival 
rates are acceptable models for user initiated sessions (FTP, TELNET) for 
intervals of one hour or less. For machine initiated sessions (SMTP, NNTP), 
however, the Poisson model failed even for short time-scales. Furthermore, 
packet level traffic deviated considerably from Poisson arrivals as well. Sim- 



ilar evidence has been found in WWW traffic [26J. Furthermore, it has been 
shown that the distribution of the packet inter-arrival times follows power 
law. Feldmann et al. [27| have presented the wavelet analysis of WAN traffic 
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samples captured around the birth of the World Wide Web between '90 and 
'97. It has been found that as WWW traffic started dominating the network 
traffic gradually different scaling behavior appeared in short- and long-time 
scales. The authors concluded that TCP dynamics might be responsible for 
short-time scaling and application layer traffic characteristics for long-time 
scaling. 

All the above properties are in strong contrast with the properties of the 
Poisson queuing systems, e.g. telephone networks, where both the correla- 
tions and the inter-arrival time distribution decay exponentially. It implies 
that well developed classical models, which provide excellent descriptions of 
circuit-switched traffic, are essentially useless for the description of the In- 
ternet. New traffic models, which provide realistic synthetic traffic, were 
required. A few important traffic models of the Internet will be presented in 
Section E2 

There are several theories which explain the origins of the observed long- 
range dependent traffic. One explanation can be that the observed traffic is 
the superposition of individual effects which happen on separate network lay- 
ers and on very different time-scales; from several minutes of user interaction 
through a couple of seconds of application response until the microsecond- 
scale of network protocol operation. Further assumptions are that heavy- 



tailed file size distribution [26|, |28|, |29j], or heavy-tailed processor time dis- 



tribution is behind the phenomena. There has also been some debate on 
whether the TCP protocol in itself is able to generate long-range dependent 



traffic 30j or not |3JJ] . The TCP's exponential backoff mechanism is also a 



possible source of heavy-tailed inter-arrival times 
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1.2 Data transport mechanisms 

The Internet is an enormous data highway between computers, where data 
packets play the role of vehicles and links serve as the road system. As on 
normal highways, congestions can form at bottlenecks if the capacity of a 
junction is exceeded by the traffic demand. 

The dynamics of the Internet traffic is governed by protocols of the Trans- 
port layer. Protocols on this layer control directly the injection rate of IP 
packets into the network. Almost all the Internet traffic is governed by two 
protocols, namely the TCP and the UDP. Therefore, understanding the op- 
eration of these protocols is very important from the point of view of traffic 
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modeling. For example, fundamental questions are how distant hosts uti- 
lize the network infrastructure and whether they can cause persistent traffic 
congestion or not. 

The performance of the network can be severely degraded as a result of 
persistent congestion. Congestion should therefore be avoided. Just such a 
congestion collapse did indeed occur in 1986 in the early Internet, when the 
useful throughput of NFSnet backbone dropped three orders of magnitude. 
The cause of this collapse was the faulty design of the early TCP. Instead of 
decreasing the sending rate of packets after detecting congestion, the early 
TCP actually started retransmitting lost packets, which led to an increasing 
sending rate and positive feedback. 



1.2.1 The User Datagram Protocol 

UDP is a very simple protocol, which provides a procedure for applications 
to send messages to other applications with a minimum of protocol mecha- 
nism [33||. Neither delivery nor duplicate protection is guaranteed by UDP. 
Furthermore, no congestion control is implemented in it either. UDPrealizes 
an open-loop control design, that is no feedback about a possible congestion 
is processed. 

The principal uses of UDPare the Domain Name System (DNS), stream- 
ing audio and video applications (e.g. VoIP, IPTV), file sharing applications, 
the Trivial File Transfer Protocol (TFTP), and on-line multiplayer games, 
to name a few. 

Since UDPlacks any congestion avoidance and control algorithm, appli- 
cation level programs or network-based mechanisms are required to handle 
congestion. In streaming applications, for example, users are often asked 
for the bandwidth of their access link, and UDPpackets are sent with the 
corresponding fixed rate. Since UDPdoes not have any feedback mechanism 
congestion collapse of the network due to UDP network overload is unlikely. 
However, aggressive network utilization should be avoided, because it can 
block other protocols, mainly TCP. 



1.2.2 The Transmission Control Protocol 

The TCP protocol is complementary to the UDPprotocol in many sense. 
Contrary to UDP, TCP is connection oriented, it guarantees in-order delivery 
and duplicate protection, congestion control and avoidance. In addition, 
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TCP is a closed-loop design which can process feedback from packet delivery. 
Accordingly, TCP is a much more complex design than UDP. In this section 
we present an overview of TCP. 

Among the applications using TCP are the WWW, email, Telnet, File 
Transfer Protocol (FTP), Secure Shell (ssh), to name a few. Since these 
applications are responsible for most of the current Internet traffic TCP is 
the most dominant transport protocol at the moment. Accordingly, under- 
standing the workings of the TCP protocol has great importance in traffic 
modeling. 

Since TCP is connection oriented, it does not start sending data im- 
mediately, like UDP. Rather it uses a three-way handshake for connection 
establishment. If the connection establishment phase is successful the data 
transfer phase follows. Finally, when all the data has been sent, the con- 
nection is terminated in the final phase. The connection establishment and 
termination phases are usually short and involve only negligible amount of 
data compared to the data transfer phase. I will therefore focus solely on the 
main phase, neglecting the other two phases. 

In the data transfer phase the TCP receiver acknowledges every arrived 
packet by an acknowledgment packet (ACK) . The ACK contains the sequence 
number of the last data packet arrived in order. If a data packet arrives out 
of order, then the receiver sends a duplicate ACK, that is an ACK with the 
same sequence number as the previous one. Duplicate ACKs directly notify 
the TCP sender about an out-of-order packet. 

If all the packets are lost beyond a certain sequence number, then du- 
plicate ACK cannot notify the sender about packet losses. In order to re- 
cover from such a situation, the TCP sender manages a retransmission timer. 
The delay of the timer, the retransmission timeout (RTO), is updated after 
each arriving ACK. The TCP sender measures the round-trip time (RTT), 
the elapsed time between the departure of a packet and the arrival of the 
corresponding ACK. The updated value of the RTO is calculated from the 
smoothed RTT, and the RTT variation as defined in 34 . 



Packets are acknowledged after RTT time period from packet departure 
if the transmission is successful. The data transfer would be very inefficient 
if the TCP sender waited for the ACK of the last packet before it sent the 
next packet. On the other hand, sending packets all at once would cause 
congestion. In order to reach optimum performance without causing conges- 
tion, TCP manages two sliding windows with the associated variables. On 
the sender side the congestion window (cwnd) limits the allowed number of 
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unacknowledged packets. This way a cwnd number of packets is transmitted 
on average during a round-trip time period. Since cwnd is used directly for 
congestion control it is changed dynamically. 

The other variable, the receiver's advertised window (rwnd), is managed 
on the receiver side. Rwnd is the size of a receiver buffer which can store 
out-of-order packets temporally. The value of rwnd is included in every ACK, 
though it usually does not change. Although the limit of the unacknowledged 
packets is the minimum of cwnd and rwnd, the later is usually large enough 
not to affect data transfer in practice. Rwnd therefore plays a much less 
important role than cwnd. For the sake of simplicity I will assume that rwnd 
equals infinity. Accordingly min( cwnd, rwnd) will be replaced with cwnd in 
all the equations below where applicable. Let us keep in mind, however, that 
this is an approximation. 

Internet's packet-switched technology implies that there are no reserved 
resources for TCP. This approach is also called best effort delivery. Moreover, 
the Internet lacks any central management authority. Accordingly, TCP does 
not have precise information about its fair share of the network bandwidth 
in the ever-changing network conditions. In the previous section we have 
seen that buffers are able to store excess traffic temporarily, but pockets are 
dropped when a buffer becomes full. Flow control, the alteration of rate at 
which packets are sent in order to get a fair share of the network bandwidth 
without causing severe congestion, is one of the most important tasks of the 
TCP. This goal is achieved by the continuous adjustment of the congestion 
window and eventually the rate at which packets are sent. 

Several TCP variants have been developed in recent years in order to 
enhance its performance in different environments These variants differ 
mainly in the congestion avoidance algorithm. The core concept, however, 
is the same in all TCP variants and has not changed significantly since its 
first specification in 1974. The classical TCP variants (e.g. Tahoe, Reno) 
try to find the fair bandwidth share by the following method: for every 
successfully transmitted and lost packet they increase and decrease their 
sending rate, respectively. This method is based on the observation that a 
packet loss is most likely the result of a congestion event. Note that these 
TCP variants obviously cause temporary congestions in the network in the 
long run. More recent variants often try to detect upcoming congestions 
beforehand via explicit congestion notifications (ECN) from routers or by 
detecting increasing queuing delays from RTT fluctuations (e.g. Fast TCP). 

I discuss the Reno TCP variant in more detail below, since currently this 
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Figure 1.2: Schematic plot of the development of the congestion window. The 
hypothetical network can handle 20 packets simultaneously, denoted by the 
dotted line. Cwnd might overrun this limit, because congestion is detected 
only after RTT latency. Note the small plateaus both in the slow start and 
congestion avoidance phase, which are due to the bursty arrival of packets. 

is the most widespread variant in use. Its congestion control mechanism 
includes the following algorithms: slow start, congestion avoidance, fast re- 
covery, and fast retransmission In figure [L2l the schematic development 
of cwnd due the above congestion control algorithms is shown. There are 
two slow start periods at the beginning of the plot. This is possible due to 
the wrong initial estimate of the slow start threshold (ssthresh). After the 
value of ssthresh has been set to approximately half of the maximum window 
the fast recovery, fast retransmission (FR/FR) algorithms are able to take 
care of the upcoming packet losses. Note the small steps both in the slow 
start and the congestion avoidance phase. The steps are due to the bursty 
departure of packets. 

Slow start and congestion avoidance 

The core of the TCP congestion control mechanism is the slow start and 
the congestion avoidance algorithms. A state variable, the ssthresh, is used 
to determine whether the slow start or the congestion avoidance algorithm 
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is used to control data transmission. When cwnd exceeds ssthresh the slow 
start ends, and TCP enters congestion avoidance. Ssthresh is recalculated 
when congestion is detected by the following formula: 

ssthresh = max(cwnd/2, 2). (1.1) 

The slow start algorithm is used at the beginning of data transfer to 
probe the network and determine the available capacity. Slow start is used 
after repairing losses detected by the retransmission timer as well. In slow 
start phase TCP begins sending at most two packets, which is a "slow start" 
indeed. Despite what the name might suggest, however, the growth of the 
packet sending rate in this phase is quite fast actually: the cwnd is increased 
by one for every ACK. This way the sending rate is doubled in every RTT, 
which means exponential growth in time. 

In congestion avoidance phase cwnd is increased by one every RTT pe- 
riod. This implies linear growth in time, which is a much more moderate 
development than the exponential growth in slow start. One common ap- 
proximating formula for updating cwnd after every non-duplicate ACK is: 

cwnd — > cwnd H -. (1-2) 

cwnd 

This formula is not precisely linear in time, but the advantage of this formula 
is that no auxiliary state variable is required for its application. 



Fast retransmit and fast recovery 

The packet sending rate is reduced drastically at the beginning of each slow 
start phase. Although the slow start algorithm restores cwnd to ssthresh 
at an exponential rate, its application might cause unnecessary performance 
deterioration. In order to circumvent slow start algorithm when possible, fast 
retransmit and f ast recovery algorithms were introduced to the Reno version 
of TCP in 1990 



37 



The fast retransmit algorithm uses the arrival of three duplicate ACKs 
as an indication that a packet has been lost. After the arrival of the third 
duplicate ACK the sender retransmits the missing segment without waiting 
for the retransmission timer to expire. TCP does not enter slow start after 
fast retransmission, but instead starts the fast recovery algorithm. Skipping 
slow start is possible because each duplicate ACK indicates that a packet 
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has been removed from the network. Therefore, newly sent packets do not 
stress the network further. 

After fast retransmission ssthresh is set according to Eq. (11.10 . In addi- 
tion, cwnd is halved, 

7 cwnd . . 

cwnd ^ — - — (1-3) 

and for each duplicate ACK a new segment is sent if possible. After the 
first non-duplicate ACK cwnd is set to ssthresh again, and TCP returns to 
congestion avoidance. Note that slow start might be forced when cwnd is 
small and duplicate ACKs are not accessible. Furthermore, if more than 
one packet is lost within one RTT time period, then the FR/FR algorithms 
may not recover from the loss either, and TCP can enter slow start algorithm 
instead. However, if the packet loss rate is low and cwnd is large enough, then 
the slow start algorithm is used only at the beginning of the TCP session, 
and cwnd is governed in an additive increase, multiplicative decrease (AIMD) 
manner by the congestion avoidance and FR/FR algorithms, respectively. 

The idea behind the AIMD rule comes from the following simple control 
theoretical arguments [Isl, Hsl]. In general, the control of the Ath TCP's 
cwnd can be given by w\(t i+ i) = f(w\(ti),y(ti)) where f(w,y) is the control 
function, which depends on the feedback (e.g. an ACK) from the system 
y(ti) G {—,+}, and the last value of the window W\(ti). The feedback is 
binary: + and — indicates whether to increase or decrease traffic demand, 
respectively. If we restrict our study to control functions, which are linear in 
W\(ti), then we obtain 

w x (t i+ i) = a y(u) + b y(ti) w x (ti), (1.4) 

where the coefficients a± and b± are constants. It is obvious that the con- 
trol equation (II .41) is additive if b± = 1, and multiplicative if b± ^ 1. The 
most important special cases of the possible control algorithms are collected 
in Table 11.11 A feasible control algorithm must satisfy two important crite- 
ria: convergence to efficiency and fairness. Efficiency in this context means 
maximum possible usage of the available resources and fairness means equal 
share of the bottleneck capacity. These criteria give constraints on the coeffi- 
cients a± and b±. It has been shown in (39| that the convergence to efficiency 
and fairness is provided by the constraints a + > 0, b + > 1, and a_ = 0, 
< b- < 1. Moreover, it has been shown that the convergence is fastest, 
when b + — 1. Therefore, the additive increase, multiplicative decrease con- 
trol, which is implemented in TCP, is the optimal control algorithm. 
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b+ = l 
a + > 


b + > 1 
a + = 


6_ = 1 
a_ < 


Additive increase 
Additive decrease 


Multiplicative increase 
Additive decrease 


< 6 < 1 
a_ = 


Additive increase 
Multiplicative decrease 


Multiplicative increase 
Multiplicative decrease 



Table 1.1: Possible control algorithms with a linear control function. 



The backoff mechanism 

Normally in slow start or in congestion avoidance mode, the TCP estimates 
the RTT and its variance from time stamps placed in ACKs. In some cases 
the retransmission timer might underestimate RTT at the beginning of the 
data transfer, and the retransmission timer might expire before the first 
ACK would arrive back to the TCP sender. In order to avoid the persistent 



expiration of the retransmission timer the so-called Karn's algorithm [401 ] 
is applied. According to the algorithm, if the retransmission timer expires 
before the first ACK would return, then the value of the RTO is doubled. 
If the timer expires again, then the timer is doubled repeatedly a maximum 
six consecutive times. Since there is a definite ambiguity in estimating RTT 
from a retransmitted packet the ACKs of two consecutive sent packets should 
arrive back successfully in order for the TCP to estimate the RTT again and 
go back to the slow start mode. 

A similar situation might occur if the packet loss rate is high. In that 
case, consecutive packets can be lost and the TCP might enter the backoff 
state, even if RTT might actually be smaller than the retransmission timer. 
Since the delay between packet departure is doubled, the effective bandwidth 
is halved after each backoff step. TCP can reduce its packet sending rate with 
this method below one packet per RTT. 



Chapter 2 

Traffic dynamics in infinite buffer 



In this chapter I study the TCP dynamics on an idealized single buffer net- 
work model where the probability that a packet is lost at the buffer is negli- 
gible compared to other sources of packet loss. The case of a semi-bottleneck 
buffer when the size of the buffer is limited will be discussed in Chapter [3l 
First, I introduce the important fluid approximation of TCP congestion win- 
dow dynamics in Section 12 . 1L In recent years many aspects of the TCP 
congestion avoidance phase have been clarified. The most important results 
of the literature are reviewed in Section [221 I define the network model under 
study in Section [231 My results on the analytic study of the TCP congestion 
window dynamics are presented in Section 12.41 The discussion of the model 
is given in Section 12.51 Finally, I summarize my results in Section 12.61 



2.1 The fluid approximation 

The equations of motion (ll.ip - fll.3l) are defined at ACK arrivals. The state 
variables are therefore changed in discrete steps at discrete time intervals 
(Fig. O), often referred to as "in ACK time". Note that "ACK time" dy- 
namics is an essential, inherent property of TCP, because it is defined in the 
TCP design and does not depend on the network environment where TCP 
is used. 

In practice the discrete-time equations of TCP dynamics can be approx- 
imated very well by continuous-time equations. Between two consecutive 
packet losses the congestion window is changed according to the fluid "ACK 
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time" equation (II .2ft . that is 

dW 1 



dt ACK W 



(2.1) 



Since the arrival of ACK packets is not uniform in time, the ACK and 
real time averages of important quantities, for instance the throughput, are 
usually different. It would be difficult and rather impractical to transform 
the dynamics of the state variables from ACK to real time exactly. A usual 
approximation is that the arrival rate of ACKs is estimated by the number 
of packets in flight, that is the congestion window W divided by the round 
trip time R: 

dt^CK W 

-ir = -R< (2 - 2) 

From the above equations one can obtain 

" W 1 (2.3) 



dt R 

which is the fluid approximation of the congestion window dynamics in real 
time. Although this real time approximation of TCP dynamics is often suf- 
ficient, I will point out its defects. I will also present a roundabout solution 
to the problems based on the fundamental "ACK time" dynamics of TCP. 

As a simple example of the fluid model let us calculate the average 
throughput, the transmitted data per unit of time, of a single TCP over 



a lossy link 41]. Let us suppose that the round-trip time is constant and the 
packet loss is deterministic. Considering these assumptions the congestion 
window changes at a constant rate between consecutive packet loss events 
as (1 2 . 3 f) . The window is halved after each packet loss event. The window 
evolution shown in Fig. 12.11 is therefore a periodic sawtooth in the interval 
[RW m /2, RW m ] and in the range of [W m /2,W m ]- The length of a cycle is 
RW m /2. The number of transmitted packets in a cycle equals the integral 
of the congestion window for one period: N = fW- 7 ^. Since in each cycle 
one packet is lost, the packet loss probability can be expressed as p — 1/N. 
Therefore, the average throughput X can be given by 
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2W m R 



Figure 2.1: The TCP congestion window evolution under deterministic 
packet loss and constant round-trip time. The congestion window W varies 
linearly between the maximum value W m and its half, W m /2. 



where P is the size of the data packets and c 



is a constant. The re- 



sulting formula, often referred to as the "inverse square-root law", expresses 
the impact of a network on TCP dynamics. The formula establishes a con- 
nection between throughput, an important characteristic of TCP, and packet 
loss probability, an attribute of the network on which TCP operates. The 
formula becomes inaccurate for large p, because multiple packet losses, which 
force TCP into the neglected slow start phase, are more probable in this case. 
The effect of multiple losses on different TCP variants is diverse, so the va- 
lidity range of the formula depends on the TCP variant under consideration. 



2.2 Preliminary results of traffic modeling 
2.2.1 Single session models 

The simple model given above can be extended considerably in a number of 
aspects. In a paper by Altman et al. |42j the TCP throughput for generic 
stationary congestion sequence was studied. The model extends the previous 
deterministic loss model to arbitrarily correlated loss sequences. The model 
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is based on the following difference equation 

X n+1 = aS n + (3X n , (2.5) 

where X n is the value of the throughput just prior to the arrival of loss 
signal at T n , S n = T n+ i — T n is the time interval between consecutive losses, 
and a and (3 are the linear growth rate and multiplicative decrease factor, 
respectively. From the time average of the throughput the following loss 
formula was derived 



X 



1 i a i 00 

+ ^C(0)+J2P k C(k), (2.6) 



R^/p\2(l-(3) 2 



fe=i 



where C(k) = (E [S n S n+ k] — E [S n ] 2 ) /E [S n ] 2 is the normalized autocorre- 
lation function of the loss interval process (S n ) n€N . The derived formula, 
applied for uncorrelated Poisson process with C(k) =0 and (3 = 1/2, pro- 
vides the same result as ( 12.41) . A correlated loss interval scenario was modeled 
with Markovian Arrival Process and the average TCP throughput was ex- 
pressed with the infinitesimal generator of the arrival process. Furthermore, 
the authors derived bounds for the throughput in case of limited congestion 
window evolution and discussed the effects of timeouts. 

Padhye et al. [43| have studied the steady-state throughput of TCP Reno 
when packet loss is detected via both duplicate ACKs and timeouts, and 
the throughput is limited by the receiver's window in more detail . The 
probability of timeout was estimated by the packet loss probability and the 
congestion window. It was shown that for small packet loss the timeout 
probability can be approximated by min(l,3/PF) and a very comprehensive 
loss formula has been derived. 

The performance of two classic TCP versions, namely Tahoe and Reno, 



has been analyzed by Lakshman and Madhow 44] when the bandwidth- 
delay product of the bottleneck link is large compared to the buffer size. The 
authors estimated the average throughput for both slow start and congestion 
avoidance phases with deterministic and independent random losses. 

In a paper by Ott et al. [45| the stationary probability distribution of the 
congestion window was calculated for constant packet loss probability. The 
authors mapped the "ACK time" point process to a continuous "subjective 
time" process by the mapping W{t) = y/pW^t j , where both the time and 

the state space of the discrete process is rescaled in order to obtain a well 
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behaved process. It was shown that for p —>■ the rescaled process W(t) 
behaves as 

dW(t) a .„ . , w , 

W (t + ) = (3W (t~) ift = r k (2.8) 

where Tk are the points of a Poisson process with intensity A, a > and 
< (3 < 1 are the linear growth rate and multiplicative decrease factor, 
respectively, m > 0, and lastly t~ and t + denote the limit to t from the left 
and from the right, respectively. The parameter values for TCP congestion 
avoidance algorithm are (3 = 1/2 and m = 1. 

The stationary complementary distribution function of the process W(t) 
has been given in the following series expansion form: 

Fw(w) = Y,Rk(c)e W (-^—w m+1 j , (2.9) 

where r) = A/a, c = (3 m+1 and for |c| < 1 

1 (_i)fc c -|fc(fc+i) 



R k (c) 



L{c) (1 - c) (1 - c 2 ) ... (1 - c fc ) 

oo 

L(c)=l[{l-c k ). 



(2.10) 



fe=i 



"ACK", "subjective" and real time averages and other moments of the con- 
gestion window were calculated and an inverse square root loss formula was 
derived. 

The model has been extended for state dependent packet loss probabil- 



ity in [46j. State dependent loss models the interaction of the TCP with 
ERD queuing policy, where the packet drop probability is a function of the 
instantaneous queue length. It is also applicable to RED routers, where the 
drop probability depends on the average queue length. An iterative solution 
for the probability distribution function of the congestion window was de- 
rived. The authors found good agreement between the derived distribution 
and computer simulations. 



An in-depth analysis of RED queuing dynamics was presented in 



47 



The time dependent congestion window development was modeled with the 
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stochastic differential equation 

dW ^-^wuc-^f dmt) ' (2 - n) 

where di denote the fix propagation delay of the bottleneck link, C is its 
capacity, q(t) is the queue length at time t and iVj(i) is a Poisson process with 
a rate that varies in time. The above equations were transformed to a system 
of delayed ordinary differential equations in order to obtain the dynamics of 
the expectation of the congestion window. The expectation value of the 
queue length q{t) and the RED estimate of the average queue length x(t) 
were approximated by two further differential equations. As a result, the 
authors obtained N + 2 coupled equations for the same number of unknown 
variables (Wi(t), q(t),x(t)) . The equations were solved numerically and were 
compared to computer simulations. The authors pointed out the importance 
of the sampling frequency of the smoothed queue length estimate. A high 
frequency sampling might cause unwanted oscillations in the system, while 
a low frequency sampling can increase the initial overshoot of the average 
instantaneous queue length. 

2.2.2 Multiple session models 

The above papers considered only a single TCP connection. In real com- 
puter networks, however, a number of TCPs might compete for the network 
resources. In particular the traffic of TCP sources may flow, in a parallel 
fashion, through a common link. Web browsing represents a good example 
of parallel TCP, as up to four parallel TCP sessions are started at each page 
download. 

A possible result of TCP interaction can be that parallel TCPs are syn- 
chronized. The underlying reason for synchronization is TCP's delayed reac- 
tion for congestion events, which keeps drop-tail bottleneck buffers congested 
for about an RTT time period. This temporary congestion can induce fur- 
ther packet losses in competing TCPs. Based on this phenomenon Lakshman 



and Madhow [44j supposed that the congestion window development of par- 
allel TCPs is synchronized in the stationary congestion avoidance regime. 
The authors also took into consideration that the bottleneck buffer of large 
bandwidth-delay product connections can be either under- or over-utilized. 
The congestion window development was therefore split into two phases ac- 
cordingly. The authors found a fixed point solution of both the duration 
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and the average congestion window of the two phases. Finally, the average 
throughput of each individual connection was estimated from the window 
size divided by the round-trip time. 

TCP synchronization is disadvantageous, since it causes performance 
degradation. However, this effect appears only in drop-tail queuing systems. 
Active queue management, such as RED and ERD, alleviate the problem of 
synchronization. A paper by Altman et al. [48| compared the synchroniza- 
tion model to one in which only one of the parallel TCPs loses a packet at a 
congestion event. The probability that a specific connection is affected was 
proportional to the throughput of the particular flow. This drop policy mod- 
els RED routers. The stationary distribution of the discretized congestion 
window at congestion instants was calculated. The average throughput was 
estimated from the calculated window distribution via a semi-Markov pro- 
cess. The authors compared their results with simulations of a RED buffer 
and found that their asynchronous model surpasses the synchronous model 
presented in 



44 



Another typical effect in multiple TCP scenario is the bias against con- 
nections with long round-trip times This effect is the fundamental con- 
sequence of TCP dynamics, and it is not affected by the queue management 
policy. The phenomenon can be explained qualitatively by the following sim- 
ple arguments. The growth rate of the congestion window is inversely propor- 
tional to the round-trip time R. The average congestion window is therefore 
inversely proportional to R as well. Furthermore, the average throughput 
X can be related to the average congestion window W by Little's law from 
the queuing theory: X = W/R. Therefore, the throughput is approximately 
proportional to l/R a , with a = 2. The exponent obtained from measure- 
ments has been shown to fall in the range 1 < a < 2 due to the queuing 



delay ignored in the above arguments [44 



Floyd and Jacobson [50] have shown that small changes in the round-trip 
time might cause large differences in the throughput of different parallel TCP 
flows. Specifically, packets of certain TCPs can be dropped tendentiously due 
to a phase-effect, causing an utterly unfair bandwidth distribution. Chang- 
ing the relative phase of arriving packets at the bottleneck link by slightly 
modifying the round-trip time can completely rearrange the bandwidth share 
of different TCP connections. Random effects, such as random fluctuations 
in the round-trip time or RED queuing policy, also alleviate the phase-effect. 
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2.2.3 The Network Simulator - ns-2 

New models, algorithms, and analytical calculations should be validated 
against experiments. Without doubt the most authentic data can be ob- 
tained from Internet measurements, but the deployment of a measurement 
infrastructure can be quite expensive and is still very limited. Moreover, 
models often use simplifications which make them difficult to compare with 
real Internet data. Network simulators, on the other hand, provide "labo- 
ratory" environments, where every parameter of the network and the traffic 
can be precisely controlled. Therefore, network simulators are important 
tools in the hands of researchers endeavoring to carry out well controlled 
experiments. 

One of the most widely used network simulators in the research commu- 
nity is the Network Simulator — ns-20 [51]. A short overview of the simulator 
is given next, since several analytic and numeric results of this thesis have 
been validated by ns-2. 

The ns-2 simulator mimics every component of a real network, e.g. links, 
routers, queues, protocols, applications and so on. The network traffic is 
simulated at packet level, which is to say the course of every packet is followed 
from its injection into the network until its removal from it. The packet-level 
simulation of network traffic makes the simulator very realistic, so fine details 
of the simulated network traffic can be observed. The major disadvantage 
of a packet-level simulation is the considerable amount of computing power 
that it requires. 

The ns-2 simulator is event-driven, that is every component might sched- 
ule events into a virtual calendar. The simulator's scheduler runs by select- 
ing the next earliest event for execution. During processing of events further 
events can be scheduled into the calendar. This event-based mechanism can 
also be observed in every part of the simulator, for example in the handling 
of data packets in ns-2. Data packets do not actually travel between virtual 
nodes in the simulator, but rather are scheduled for processing at different 
network elements instead. For example, when a packet is put onto a link for 
transmission the link object in the simulator only schedules the packet for 
the queue of the next node on the other end of the link. 

The core of the simulator has been written in C++, but it also has an 
OTcl scripting programming interface, the object oriented extension of Tel. 
The C++ core offers fast execution of the simulator. However, average users 



The next major version of the simulator, ns-3, is under active development. 
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Figure 2.2: Idealized network model with infinite buffer capacity. Other parts 
of the network are modeled with link delay D, bandwidth C, and packet loss 
probability p. 



do not need to deal with C++ code in order to run simulations under ns-2. 
All the network elements have been bound to objects in OTcl, so complex 
scenarios can be built up simply and easily by writing short OTcl scripts. 



2.3 The infinite-buffer network model 

A very simple model of an access router connected to a complex network 
consists of a buffer and a link, shown in Fig. I2.2L In this regard the link is 
not a real connection between routers, but rather a virtual one. The influence 
of the network on the traffic using the access router is modeled with a few 
parameters of the virtual link: a fixed propagation delay D, bandwidth C, 
and packet loss probability p. This probability represents the chance of link 



and hardware failures [52J, incorrect handling of arriving packets by routers, 



losses and time variations due to wireless links in the path of the connection 



44j, the likelihood of congestion in the instantaneous bottleneck buffer, and 
the effect of RED and ERD queuing policies. In this chapter I assume that 
the buffer is large enough that no packet loss occurs in it. 

Two practically important limits in respect of the role of the access buffer 
are: a) LAN traffic, when the bandwidth delay product of the link is small, 
only a few packets can be out in the link and the buffer is never empty; 
and b) WAN traffic, when the bandwidth delay product of the link is large, 
packets are in the link and the buffer is mostly empty. From now on I will 
refer to systems with small and large bandwidth delay products as LAN and 
WAN, respectively. 

As I mentioned earlier in Section 11.2.21 the Internet traffic is governed 
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mostly by TCP. I will therefore neglect UDP traffic in my simple network 
model and will only study the behavior of TCP dynamics. 

In realistic networks many TCP sources [53| may share the resources of 
the access network. The difficulty of describing the parallel TCP dynamics 
lies in the interaction of individual TCP flows. It is obvious that the number 
of packets in the network injected by one of the TCP sessions affects the 
networking environment of the others. In particular, it contributes to the 
round trip times and packet losses felt by the other TCPs. Since the con- 
gestion window controls the maximum number of unacknowledged packets, 
understanding its distribution is crucial to describe the interaction. 

While an exact treatment of nonlinear interacting systems (such as this 
one) is not possible in general, very efficient methods, motivated mostly by 
interacting physical systems, have been developed. One of the most estab- 
lished methods is the mean field approximation. In this approximation each 
subsystem operates independently in an "averaged" (or mean) environment. 
The average environment is calculated from the behavior of the subsystems. 
Finally, a fixed point of the system has to be found where the "mean" en- 
vironment and the environment averaged over the independent subsystems 
coincide. This way we obtain a self-consistent solution which provides an 
approximate but quite accurate description of each subsystem. 

In the case of computer networks each TCP plays the role of a subsystem, 
while the environment is the round trip time. First, the congestion window 
distribution of a TCP is calculated by assuming a given packet loss and round 
trip time. Next, the mean round trip time is calculated using the window 
distribution. Finally, a fixed point value of the round trip time is determined. 

2.4 Dynamics of a single TCP 

For studying the behavior of interacting TCPs by mean field approximation 
one should know the behavior of a single TCP first. In this section I carry 
out the analysis of a single TCP with the use of the fluid approximation of 
TCP dynamics, presented in Section I2TT1 

Recall that between two consecutive losses the congestion window is gov- 
erned by the continuous time differential equation (12.31) : 



dW 1 

~~d7 ~ R(wy 



(2.12) 
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(a) ns-2 simulation of LAN (b) Ideal fluid model of LAN 

Figure 2.3: Comparison of ns-2 simulations with the fluid approximation of 
the congestion avoidance process of TCP/Reno. Note the small plateaus in 
the simulations due to the FR/FR algorithm after each period. 



where W G [0, oof is the congestion window, and R(W) is the round-trip 
time, which might depend explicitly on the value of the congestion window. 

Consider, for example, a typical LAN scenario with a single TCP where 
the link delay is small and packet delay is caused mostly by buffering. The 
congestion window counts the number of unacknowledged packets, and these 
packets can be found on the link and in the buffer. At each packet-shift time 
unit a packet is shifted from the buffer into the link. The round-trip time 
of a freshly sent packet will be the time it should wait for the shifting of 
all previously sent packets in the system, which is in turn measured by the 
congestion window R(W) = WP/C. 

An idealized congestion window process is shown in Fig. 2.3(b), while a 
simulated congestion window sequence can be seen on Fig. 2.3(a) for com- 
parison. Numerical simulations were executed by Network Simulator (ns-2), 
introduced in Sec. l2.2.3l Note the small plateaus in the simulations after each 
cycle of the congestion window process. These plateaus are the result of the 
FR/FR algorithms. First, I will ignore the effect of the FR/FR algorithms, 
and I will consider their influence later. 

In order to include more general — even hypothetical — TCP dynamics in 
my model the round-trip time is written in the following form: 



R(W) = a~ l W 



(2.13) 
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with m > and a > 0. Note that this notation includes the "ACK time 
dynamics of TCP as well. 

I Oh n Q ti wri + fon nnw etc 

dt W 



The fluid equation 02. 121) can be written now as = p^r, which can be 



rearranged into 

— = or (m + 1). (2.14) 

It is obvious that between two packet loss events the solution of this differ- 
ential equation is 

W m+1 {t) = W m+1 {n) + a (m + 1) (t - n) , (2.15) 

where Tj denote the instant of the i th packet loss. At r, the transformation 

W(t+) = f3W(T~) (2.16) 

is executed, where W{r~) and W(7^ + ) are the congestion windows immedi- 
ately before and after the time of packet loss, and < (3 < 1. The actual 
value of (3 is 1/2 in most TCP variants. 

Let Wi = W(r^) denote the congestion window immediately after the 
i th packet loss, <5j = r i+1 — Tj the length of the time interval between two 
losses, and c = j3 m+1 hereafter. Since TCP is assumed to detect packet losses 
instantaneously in the fluid model, W™^ 1 can be written as 

W? + t l = [f3W(r w )] m+1 = cWr +1 + a (m + 1) cd,. (2.17) 

By repeated application of (12.170 one can show that the value of the 
congestion window immediately after the N th packet loss is 

JV-l 

W% +1 = c N W™ +1 + a (m + l)^2 c k+1 S N -k-i- (2.18) 

k=0 

For N — > oo the initial value Wo becomes insignificant and the sequence of 
congestion window values after the packet losses (W a .i.) can be expressed as 

oo 

W aS m+1 = lim W™ +1 = a {m + 1) V c k+1 5 k . (2.19) 

N—+00 ' J 

k=0 

Note that the indexing of the Si sequence is reversed. This was allowed 
since, as I will show below, every <5« had the same statistical properties. 
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The reversed indexing was necessary since the infinite sum would have been 
meaningless without it. 

In the LAN case one packet is shifted out of the buffer in each time unit. 
Therefore, in the fluid approximation the times between packet losses, 5i, 
are independent exponentially distributed variables. The WAN scenario is 
slightly different. Since there is no queue in the buffer there are periods when 



no packet leaves the buffer (see the small horizontal steps in Figure 2.6(a)), 
packets cannot be lost in those intervals. However, I assume first that times 
between losses are exponentially distributed in the WAN scenario as well — 

f Si (x) = \e-**, (2.20) 

where 1/A is the average time between losses — and I will improve the model 
later. For the LAN case A = pC/P, since one packet is shifted from the 
buffer in P/C packet-shift time. Combining (12.191) and (I2.20p one can obtain 
the distribution of W & \. 



OO OO / \ 

/» / oo \ oo 

■■■ 5 lw-a(m + l)J2 c k+1 x k ]J f 5i (a*) dx t , (2.21) 

where 8() is the delta distribution. In reality only the distribution of generic 
values of the congestion window can be measured. Therefore, their distribu- 
tion has to be derived as well. Here I show that this can be done analytically. 

In general, between losses the congestion window is the sum of two ran- 
dom variables W m+1 = W & \ m + a (m + 1) r, where r is a uniformly dis- 
tributed random variable in the random interval [0, 6i). To obtain the proba- 
bility distribution of the congestion window at an arbitrary moment we have 
to derive the distribution of the random variable r as well. Its distribution 
can be derived as follows: r is distributed uniformly on each interval with 
length p, assuming p is given. This statement can be expressed mathemati- 
cally with the conditional distribution 

fr(t I P = x) = -X[o,x](t), (2.22) 

where Xuiw) denotes the indicator function of set H C R. Furthermore, 
the probability of selecting a random interval is proportional to the length 
of the given interval and the distribution of the random variable Si. The 
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proportional factor can be deduced from the normalization condition of the 
probability distribution: 



x 



■f Si {x) = Xxf 5i (x) 



(2.23) 



f™ x fSi(x)dx E[Si] 
Finally, the desired distribution follows from the total probability theorem: 



fr{t) 



f T (t | p = X)f p {x) dx 



X I Xe~ Xx I 5(t -y)dydx = X I Xe~ Xx dx = Xe 



,-At 



(2.24) 



In order to calculate the distribution function of a general value of the 
congestion window we apply the method of the Laplace transform. As is 
known, the Laplace transform of the density function of the sum of inde- 
pendent random variables is the product of the Laplace transform of their 
density functions. The Laplace transform of (12.211) with respect to W & ,\ m ^~ 
can be defined as f w m+i(s) = J °° e~ sw f w&l m+i(w) dw. This can be easily 
evaluated and we get 



f w , m+i(s) 



n 

k=0 

oo 

n 



A / exp (sa (m + 1) (3 m{k+l) x k - Xx k ) dx k 
X 



(2.25) 



X + a (m + 1) c k s 

k=l 

The Laplace transform of the distribution of a (m + 1) r is given by 



/a(m+l)r( s ) 



A 



a (m + 1) 



"<*(m+l)*e~ S * dt 



X 



X + a (m + 1) s 



(2.26) 



Therefore, the Laplace transform of the generic W m = W a j, m + a (m + 1) r 
distribution is the infinite product 



f W m+i(s) = fw aA m+1 ( S ) fa{m+l)r(s) = JJ 



A 



/c=0 



A + a (m + 1) c k s 



(2.27) 
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Furthermore, we can rewrite the Laplace transform as a sum of partial frac- 
tions 

v ' k=0 a(m+l) T 

where the coefficients hk(c) can be obtained from the residues of the poles of 
fw m {s)'- 

h n t> a (m + 1) ~ 

ftjfc(c) = Res f Wm +i{s) 

Ac- fc A 
a(m+l) 

= l lm "(■"+!) f s + *g \ft A 

a , Ac-fc A V a (m + 1) / J-- 1 - A + a (m + 1) c z s 

a:(m + l) x /=0 

= ^ n i _ c ;-fc = n i3^=i' ( 2 - 29 ) 

(=0 v ' 1=1 

where 

oo 

L(c) = n (1 - c l ) . (2.30) 
i=i 

It follows evidently from this formula that the relative strength of successive 
terms eventually decreases exponentially fast, when k is large enough: 

~J~TY' = ~° k ~\ — ~fe+T ~ ~° k forfc»-J-, (2.31) 
hk{c) 1 — c fe+1 logc 

therefore, only a small number of constants should be used for numerical 
purposes. 

We can perform a term-by-term inverse Laplace transform on (|2.28l) . The 
density function of W m can be given by 

A °° / \c~ k \ 

f wm+ i(w) = — — V/i fc (c)exp —w . (2.32) 

a(m + l)^^ \ a(m + l) J 

The distribution of the congestion window is given by a simple variable trans- 
formation 

fw(w) = -«,- £ fc(c) exp (--^^J . (2.33) 
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Finally, the complementary cumulative distribution F w (w) = fw{ w ') dw' 
can be given by 

OO / , \ 

F w (w) = V c k h k (c) exp ( . (2.34) 

\ a(m + l) J 

Note that the above formulas do not change when A and a are varied, but 
the A/a ratio is kept fixed. Furthermore, the weight of the k th term in the 
probability distribution is c k hk(c), so the error induced by truncating terms 
above a threshold index can be estimated precisely: 1 — J2k=o ° k hk{c) ■ 

Compare the results (12321) - (12311) with - (pUP) ) . The calculation 
has reproduced the results of Ott et al. [45j] with a slight difference in the 
notation. However, I have not supposed in my derivation that p — > 0, as 



Ott et al.l did. Furthermore, I have shown explicitly that r is exponentially 



distributed, which is missing in the previous derivation. 
The moments of W can be calculated from (12.321) as 



A ^ 



E[VT (m+1) ]= / W r f W m +1 ( W )dw = — ; ^ ^Mc) / W r e-^+^ W dw 



a (m + 1) 



A:=0 



a (m + 1) 
X 

a (m + 1) 
X 



x poo 

OO 

T(l+r)J2c {1+r)k h k (c), 



k=0 



(2.35) 



where r > 0. The variable transformation z = Y M , ro was executed 
in the first integral, and the integral definition of the Gamma function 
T(x) = J °° z x ~ 1 e~ z dz was applied in the second equation. If r = n is an 
integer the moments can be given in closed form. For this end let us find 
the series expansion of the Laplace transform of W m+1 . Observe — following 



Ott et al.l — that the product form of the Laplace transform (12.271) satisfies 



the following functional equation: 

H s j f w , n +i(s) = f W m+i(cs). (2.36) 

Differentiate n times both sides of this functional equation with respect to s: 



1 + ^±Hs) + „2>l±il/<»-) ( 3 ) = c"4"i +lM . (2.37) 
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Table 2.1: h k (f3 m+1 ) coefficients with (3 = 1/2 and m = 1 



ho 1.4523536 h 3 -3.2786819 • 10" 2 h 6 1.9643078 • 10" 9 

hx -1.9364715 h A 5.1430305 • 10" 4 h 7 -4.7959661 • 10~ 1: 

h 2 5.1639241 • HT 1 h 5 -2.0109601 • 10~ 6 h 8 2.9272701 • 10 -1 ' 



Since the n th derivative of the Laplace transform at s = is related to the 
moments as E[W n ( m+1 )] = (— l) n /^l(0), we find the following recurrence 
relation: 

(1 - c n ) E[W n{m+1) ] = n Q! ( m + 1 ) E py(n-i)(m+i)] - ( 2 _ 38 ) 

A 

This recursive equation with the initial condition E[W ] = 1 immediately 
yields 

E[w »(nH-D] = n , f ^(m+l) \ n ^ 1_ (2 
^ ' k=l c 

2.5 Discussion 

2.5.1 Local Area Networks 

I will now confirm the validity of the above results by numerical simulations. 
In this section I study the LAN scenario, that is when the bandwidth-delay 
product of the link is much smaller than the size of the buffer. As I have 
pointed out earlier the parameters of the ideal LAN scenario are m = 1, 
c = P 2 = 1/4, and a _1 A = p. The first nine numerical values of h k (c) are 
shown in Table 12.11 It can be seen that the coefficients converge to zero so 
quickly that it is sufficient to keep the first five terms in practical calculations. 

The mean of the congestion window can be calculated from (12.351) with 
the parameter r = ^-j- = |: 

r i /2x/tF <A Wl/4) 1.5269 



which gives the well known inverse square-root formula. The second moment 
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(a) Average vs. mean 
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<r[W\ 

(b) Empirical standard deviation vs. stan- 
dard deviation 



Figure 2.4: Empirical mean and standard deviation of the congestion window 
as the function of the corresponding theoretical values in the range of loss 
probabilities p = 0.1%-5%. 

can be obtained exactly from fl2.39j) with n = 1: 

E[W 2 } = - l — = — , (2.41) 

therefore the standard deviation is approximately 

, n ^70,0 

a[W] = ^E[W 2 } - E[W] 2 « — ]=-. (2.42) 

yP 

In Figure 12.41 the empirical mean and standard deviation of the conges- 
tion window is plotted as the function of the theoretical values ( 12.401) and 
(12.421) . respectively. The model predicts measurement points on the diag- 
onals, shown with dotted lines. We can see that the empirical standard 
deviation agrees well with the theoretical values, but the average congestion 
window is systematically smaller than predicted. The linear fit f(x) = x+b of 
the data points gives an estimate for the average shift b = —1.4141 ± 0.0611. 

The most important source of error is that FR/FR algorithms have been 
neglected in my idealized model, but the simulator does use these algorithms. 
The small plateaus appearing in the congestion window after each cycle pro- 
duce bias towards the smaller window values. 

In a refined model let us consider the FR/FR algorithms as well. Denote 
W the fluid approximation of the extended congestion window process, which 
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can operate in either congestion avoidance (CA) or FR/FR mode. TCP 
remains in FR/FR mode until the ACK of a retransmitted packet reaches 
the sender, that is the round-trip time before the FR/FR mode started: 
R^P^W). Furthermore, the probability that a plateau forms in the window 
interval [w,w + dw] equals the probability that the window is reduced to the 
given interval after a packet loss: fw al (w) dw. The form of the distribution 
fw al (w) might be derived by direct calculation, but it can also be found 
by a simple argument: in stationary state of TCP — since the packet loss 
process is memoryless — packet loss can occur at every congestion window 
value with the same probability, supposing that the window has reached the 
given value. In other words the value of the "after loss" congestion window 
is W a .i. = /3Wb.i. = f3W. Accordingly, the distribution of the "after loss" 
window is /w al (w) dw = fw(l3~ l w) df3~ l w. On condition that TCP is in 
FR/FR mode the probability distribution of the congestion window is 



On the other hand, the window distribution in the congestion avoidance 
mode can clearly be given by fw( w I TCP in CA mode) = fw( w )- Now 
only the probabilities of the CA and FR/FR modes are required. Since each 
congestion avoidance phase is followed by a FR/FR mode, probabilities of the 
different modes are proportional to the average length of the corresponding 
mode. The mean length of a congestion avoidance mode is evidently the 
average time between two packet losses: E[5i] = 1/A. The average length 
of a FR/FR period, on the other hand, is simply the average length of the 
plateaus: E[i2(W)]. This implies that 



f w (w | TCP in FR/FR mode) 



R^w^fwj^w) 
E[R(W)\ 



(2.43) 



P(TCP in CA mode) 



1/A 



and 



(2.44) 



P(TCP in FRFR mode) 



l/\ + E[R(W)Y 
E[R(W)] 



(2.45) 



l/X + E[R(W)]' 



Therefore, the probability distribution of the congestion window extended 
by the FR/FR algorithm is 
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where (12.131) . the definition of R(W) has been substituted. This formula is 
the main result of this section. In Figure l2~5l the histogram of the congestion 
window simulated with ns-2 and (I2.46P are compared in the range of loss 
probabilities p = 0.1%-5.0%. We can see an almost perfect match between 
theory and simulation. In order to illustrate the improvement of the formula 
(12.461) on (12.331) . I plotted fw{w) with dotted lines for comparison. I must 
stress here that there are no tunable parameters in (12.461) and no parameter 
fit has been made. 

Let us calculate the moments of W: 

~ tl E[W k ] + (3 k ±E[W m+k ] 
E[W k ] = — - — y s 1 -. 2.47 

As an important special case we can calculate the correction of the FR/FR 
algorithms to the mean of the congestion window: 

a 1 + v ' 

If the formula (12 .35fl is substituted into the above equation one can obtain the 
dependence of the correction on A/a. Specifically, for m — 1: E[W — W] ~ 
Interestingly, the correction tends to a constant in the small 



0.9981 



1+1.5269^/5 

loss limit: lim A / Q ^o E[W — W] ~ —0.9981. In the range of loss probabilities 
p = X/a = 10~ 4 -5 • 10~ 2 , investigated by simulations, the correction to the 
congestion window average is between —0.8659 and 0.9831, which is less than 



observed in Fig. 2.4(a) The remaining discrepancy comes from the difference 
between the continuous and the fluid value of W. In the simulation the 
congestion window is not only halved its integer part is also taken. This 
discrepancy accounts for approximately —0.5 unit shift on average. The 
slow start mechanism, which becomes more and more dominant as the loss 
probability increases, also makes the small window values more probable. 
However, these effects are beyond the scope of the applied fluid model. 



2.5.2 Wide Area Networks 

I turn now to the WAN scenario, where buffering delay is very small compared 
to the link delay. A typical congestion window sequence is shown in Figure l2~Ul 
with D — Is and p = 0.01. 
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Figure 2.5: Histograms and theoretical distributions of congestion windows 
in LAN. Network parameters are C = 256 kb/s, P = 1500 byte, and D = s. 



40 



CHAPTER 2. TRAFFIC DYNAMICS IN INFINITE BUFFER 





(a) ns-2 simulation of WAN 



(b) Details of the cwnd in the bounded box 



Figure 2.6: The congestion avoidance process of TCP in WAN setup. The 
global congestion window development is seemingly linear, but the detailed 



plot on Figure 2.6(b) shows a different picture. The globally linear growth is 
composed of alternating idle and LAN-like active periods. See the discussion 
in the text. 



The applicability of the developed model depends on two crucial factors: 
the validity of the exponential inter-loss distribution (I2.20P and the validity 
of f)2.13p . the dependence of round-trip time on the congestion window. The 
difficulty of the WAN scenario is that — as I mentioned earlier — there are 
periods when no packet leaves the buffer. This effect corrupts the validity of 
both assumptions. Packets cannot be lost during idle periods, so the inter- 
loss time distribution deviates from exponential distribution. Furthermore, 
if we assume that the round-trip time is constant, R(W) = 2D, then the 
solution of the equation of motion ( 12.151) predicts linear congestion window 
development, which corresponds to m = in the model. However, a typical 



sequence of congestion windows, displayed in Fig. 2.6(b), shows step-like 



growth instead. Another difficulty is that even if the inter-arrival times can be 
approximated with an exponential distribution, the connection between the 
packet loss probability p and the parameter of the distribution A is unknown. 

Given these concerns I approach the congestion window development in 
a WAN network in a manner different from simple linear growth, the model 
applied exclusively in the literature. First of all let us investigate the window 
development in congestion avoidance mode in more detail. The fine structure 



of the congestion window is shown in Fig. 2.6(b) During the active period 
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of TCP, when ACKs are arriving back to the sender (in "ACK time"), the 
congestion window is being increased the same way as in a LAN network. 
The difference from LAN in a WAN scenario is that an idle period follows 
with a constant congestion window. Since W packets are transferred in an 
active period, the length of an active period is W/a. The following idle period 
is 2D — W/a long, because the total length of an active and the succeeding 
idle period is precisely one round-trip time, 2D. 

Let W* denote the congestion window idle periods included. If the 
plateaus corresponding to idle periods are approximated as if they were 
blurred evenly on the active periods, then — analogously to the FR/FR mode 
in LAN — the conditional distribution of the congestion window in idle mode 
of TCP can be formulated as 

2 ^^f w (w)Q(2D-w/a) 
fw*{w | TCP in IDLE mode) = (2.49) 



E 



2 -^e(2D - W/a) 



Only the probabilities of idle, CA and FR/FR modes are required. The 
probability of each mode is proportional to the average time TCP spends 
in the particular mode. Considering the idle mode, the average length of 



a plateau in one ACK increment is E 

the window is increased E[5J = 1/A times in one loss cycle on average. 
Accordingly, the probability of idle mode is 



2 -^Q(2D-W/a) 



Moreover, 



E\^^e(2aD-W)] /X 
P(TCP in IDLE mode) - L 11 1 



1/A + E[R(W)} + E [*^w®(2aD - W)] /X 

(2.50) 

The probabilities of CA and FR/FR modes in Eqs. f liOil) and (12T451) should 
be modified proportionately. As a result we obtain 

^ F w (2aD) + ±E[W m ] + 2aDE [±Q{2aD - W)] 

(2.51) 

for the congestion window distribution, where I used that E[Q(2aD — W)} = 
J 2aD fw(w) dw = 1 — Fw(2aD). The truncated expectation of 1/W can 
be obtained similarly to (12.351) with r = — ^tj, but one should include the 
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incomplete Gamma function T(z,x) = f^°x z 1 e x dx as well: 



E 



W 
1 

w 



Q(2aD-W) 
X 



2aD 



a (m + 1) 



w 



-f w {w) dw 



fc=0 



m 2XDc k \ . 
"> , ; ) c™+ih k {c). 



m + 1 m+1 



(2.52) 



The moments of W* can be given easily: 



E[jy* fc ] 



E[W k ] + ±p k E[W m+k ] + E [(2aD W^ 1 - W k ) Q{2aD - W 



F w (2aD) + l^[W m ] + 2aD E [±Q(2aD - W)] 

(2.53) 

The distribution and moments of the ideal WAN scenario can be obtained 
from (12.511) and (12.531) in the aD — > oo limit: 



lim 

aD— >cx 



fw*{w) 



/h/(w) and 



lim 

aD— >o 



E[W* fc ] 



fe-n 



(2.54) 



Note that the formula (1 2 . 5 1 f) at D = reduces to (12.461) . derived earlier 
for an ideal LAN scenario with CA and FR/FR modes. Note further that 
(12.511) is applicable not only for the ideal WAN or LAN scenarios, but also 
for for the most generic configuration. Moreover, I would like to emphasize 
that the parameters of the model can be obtained from the intrinsic "ACK 
time" dynamics of TCP, and no parameter fitting is necessary. Specifically, 
for TCP/Reno the parameters are m=l, X/a = p, [3 = 1/2 and 2aD is the 
bandwidth-delay product measured in packet units. 

In order to verify (12.511) I carried out simulations. The link parameter 
2aD has been set to 170 packets and the packet loss has been varied in the 
range of p = 10~ 4 — 5 ■ 1CT 2 . Simulation results are shown in Fig. 12.71 I have 
plotted the contribution of the active periods to the theoretical distribution 
with dotted lines for comparison. A transient between the ideal LAN and 
WAN network configuration can be observed at p = 10~ 4 , since the parameter 
2aD = 170 falls in the bulk of the distribution. An excellent fit can be seen at 
small loss probabilities and a small discrepancy can be detected in the mid- 
range 10~ 3 < p < 10~ 2 . For probabilities p > 0.01 the neglected slow start 
mechanism becomes more and more significant. As a result the theoretical 
distribution deviates from the measured histogram more markedly. 
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Figure 2.7: Histograms and theoretical distributions of congestion windows 
in WAN. The bandwidth-delay product is 2DC/P = 170.67, measured in 
packets. Note that this value falls in the bulk of the distribution at loss rate 
p = 10~ 4 , which means a transient between the ideal LAN and WAN. 
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2.5.3 Dynamics of parallel TCPs 

Now, I am in the position to extend my results for parallel TCPs. Since 
(12.511 ) involves only the intrinsic TCP dynamics, the model parameters m = 
1, A/a = p, and f3 = 1/2 are the same for parallel and a single TCP. 
The propagation delay parameter 2aD might change, however, because the 
interaction of different TCPs might alter the idle periods experienced by the 
individual TCPs. 

The parallel TCPs operate in WAN environment until the number of 
packets in the network, that is the sum of the congestion windows Yli=i W* , 
is less than 2aD. Let us consider one of the parallel TCPs and denote its 
congestion window by W*. The length of the idle periods felt by the selected 
TCP is 2D—W* I a in the WAN case and the propagation delay is independent 
of the states of the different TCPs. The congestion window distribution of 



each individual TCP can therefore be given by ( 12.511 ). In Fig. 2.8(a) I show 
the congestion window histogram of one out of two parallel TCPs. The link 
delay is D = 2s, large enough to leave the buffer empty. As a comparison 
I also show the histogram of a single TCP and the theoretical distribution 
function for the same network configuration. It is apparent that the two 
histograms are almost identical and the discrepancy between the theoretical 
distribution and the measured histogram remains almost the same for parallel 
TCPs as for a single one. 

In the LAN scenario, when the sum of the congestion windows is larger 
than 2aD, a queue starts forming in the buffer and the buffering delay be- 
comes significant. The selected TCP suffers Yli=i W* / a — W*/ a long idle pe- 
riods, caused by intermediate packets sent by the rest of the TCPs. Thereby, 
the dynamics of the TCPs becomes coupled and they cannot be handled as 
being independent any more. 

In an attempt to solve this problem I am going to use the mean field 
theory, that is I suppose that TCPs are independent and they feel only the 
average influence of other TCPs. For a large number of TCPs the sum of con- 
gestion windows will fluctuate around its average 



and the deviation from this average will be of order ~ v N. For sufficiently 
large N the relative size of fluctuations will decay as ~ 1/y/N. Therefore, 
for large N it is reasonable to replace the sum of congestion windows with 
its average. In this approximation each TCP operates in a WAN-like envi- 
ronment, since they feel a constant delay as in WAN. So we can apply the 
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(a) WAN with D = 2s (2aD = 85.33). (b) LAN with D = 0s (NE[W*} « 36.55). 

Figure 2.8: Illustration for the mean field approximation for two parallel 
TCPs. The congestion window histogram of one of two parallel TCPs is 
shown. The histogram of a single TCP is displayed for comparison. Network 
parameters are C = 256 kb/s, P = 1500 byte and p — 5 ■ 10~ 3 . 



corresponding results of WAN. We simply have to replace all occurrences of 
2aD in (I2.5ip with iVE[W*], the mean field approximation of the sum of the 
congestion windows. 

The self-consistent mean field solution for E[W*] can be obtained from 
(12.531) . The occurrences of 2aD have to be replaced with iVE[W*] again, and 
the fixed point solution for iVE[W*] should be found. The simplest method 
for finding the fixed point solution is to iterate (12.5311 : start with a good 
estimate of the mean field solution, calculate the next estimate with the 
equation and replace the new value to the right hand side of the equation. 
This process should be repeated until the desired precision is achieved. A 
good initial value for the iteration is the mean congestion window in the 
aD — > oo limit (12.541) , because many parallel TCPs (N ^> 1) are close to the 
ideal WAN scenario. 



In Fig. 2.8(b) the congestion window histogram of one out of two parallel 
TCPs is presented in a LAN environment, when the link delay is D = 0s. 
The mean field approximation of the distribution function shows an excellent 
fit. The histogram of a single TCP in the same network configuration is also 
plotted with the corresponding theoretical distribution. The two histograms 
are rather different but both theoretical distributions are close to the corre- 
sponding empirical values. Fig. 12.91 shows a similar LAN scenario with one 
of 20 parallel TCPs. 
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Figure 2.9: Congestion window histogram of one of 20 parallel TCPs. The 
histogram of a single TCP is displayed for comparison. Network parameters 
are C = 256 kb/s, P = 1500 byte and p = 10 -3 . The mean field solution of 
(12331) is NE[W*] ss 827.75, close to an ideal WAN scenario. 

2.6 Conclusions 

In this chapter I analyzed the congestion window distributions of TCP in 
a standalone, infinite-buffer network model. I derived new analytical for- 
mulas for the distribution of generic congestion window values, which take 
into consideration not only the congestion avoidance mode, but also the fast 
retransmit/fast recovery modes of TCP. My novel approach for modeling 
WAN configuration made it possible to describe TCP traffic with all model 
parameters at hand; no parameter fitting is necessary. Moreover, I presented 
analytic calculations not only for ideal LAN and WAN scenarios, but also 
for intermediate network configurations, where the queuing and link delays 
are comparable. The mean field theory has been applied for parallel TCP 
traffic. My analytic calculations were verified against direct simulations. The 
analytic results fit the histograms I received from the simulations when the 
packet loss probability is small as well. Discrepancies between the analytic 
results and simulations become stronger when the packet loss probability in- 
creases, however. The differences mostly come from the neglected slow start 
mode of TCP and the fluid approximation of the discrete time congestion 
window process. The main virtue of my work is that it provides an analytic 
description of TCP traffic in more detail than previous works, without the 
need to adjust parameters empirically. 




Chapter 3 

Traffic dynamics in finite buffer 



In the previous chapter I assumed that the common buffer under investigation 
was not a bottleneck buffer. The model describes the dynamics of TCP in 
the presence of external packet loss quite accurately. However, packet loss 
in current networks is generated predominantly by overloaded buffers. This 
is an inherent property of TCP congestion control mechanism since TCP 
increases its packet sending rate until packet loss occurs in one of the buffers 
along the route between the source and the destination. In the literature 
little or no progress has been made towards an understanding of the detailed 
mechanism of packet loss in IP networks. 

In this chapter I give a detailed mathematical description of the packet 
loss mechanism. In Section l3Tl the refined network model is defined. I inves- 
tigate the dynamics of TCP in the presence of a finite buffer in Section 13.21 
I discuss my model in Section 13.31 where I will derive analytic formulas for 
the packet loss and the congestion window distribution. The new formulas 
and distributions are validated by direct simulation. Finally, I conclude this 
chapter in Section l3Tl 

3.1 The finite buffer model 

My extended network model is very similar to the model I studied in the 
preceding chapter with the decisive difference that the buffer size B is finite 
now (Fig. 13 . If ) . The remaining part of the network is — as in the previous 
chapter — modeled by a fixed delay D, constant bandwidth or link capacity 
C and random loss probability of p per packet. In my idealized network 
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Buffer 



Sink 



Link delay, D 

Packet size, P_ 

Loss rate, p 
Bandwidth, C 



Figure 3.1: The finite buffer model. In numerical simulations packet size 
P = 1500 byte and bandwidth C = 256 kbj s have been fixed, and the buffer 
size B and the packet loss probability p have been changed. 




model one TCP injects packets into the buffer. 

The buffer is large enough such that TCP can operate in congestion avoid- 
ance mode, but it is finite, so that packet loss can occur in it. Furthermore, 
the number of packets getting lost in the buffer is comparable with the full 
packet loss, including the corrupted and lost packets in the rest of the net- 
work. 

We can estimate the parameter range where the finiteness of the buffer 
plays an important role in a LAN scenario, when the link delay is negligible. 
The finite buffer size limits the total congestion window achievable by TCP 
to «)„„ ~ B. On the other hand, I have shown in the last chapter that 
external packet loss in the core network would set the average congestion 
window to (w) ~ cj \fp, where c ~ 1.5269. If cJ.Jp ~ B, that is pB 2 « c 
holds then the external and the buffer loss play comparable role. A more 
detailed analysis will be given in Section 13.3. 1L 



3.2 Dynamics of a single TCP 

In this section I present the analysis of a single TCP operating in my finite 
buffer model. I start off by the fluid equation 

— = — (3D 

dt R(wy ( ' 

similar to Eq. (|2.12l) of the infinite buffer model, but with the important 
difference that the maximum congestion window is limited: w G [0, B], where 
B = B + 2DC/P. The round-trip time is supposed to be the same as ( 12.131 ): 



R(W) = a^W m , 



(3.2) 
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1000 2000 3000 4000 5000 6000 7000 8000 9000 10000 
t 

(a) ns-2 simulation of LAN 




(b) Ideal fluid model of WAN 



Figure 3.2: The congestion avoidance process of TCP/Reno in the case of 
finite buffer, obtained form ns-2 simulations . The idealized fluid approxima- 
tion of the congestion window is also shown for comparison. 



where a > and m > 0. The above equations can be solved the same 
way as the equations of the infinite buffer scenario and we obtain the time 
development of the congestion window between losses: 

W m+X {r) = VF m+1 (r*) + a (m + 1) (r - n) , (3.3) 

where T{ denotes the instant of the i th packet loss as before. 

An idealized congestion window process with m = 1 can be seen in Fig- 
3.2(b) f or a LAN network. In order to validate my model I implemented 



ure 



it in ns-2. Simulation results of the congestion window process can be seen 



in Figure 3.2(a) for comparison, with C = 25Qkb/s, P = 1500byte, B = 50, 
D = 0s and p = 0.0008 parameter values (pB 2 = 2). Equation (13.31) gives 
a reasonably good description of the window development. The effect of 
discrepancies will be discussed in Section 13.3.21 

Furthermore, 5i, the elapsed time between consecutive packet losses oc- 
curring at the external link, are supposed to be independent, exponentially 
distributed random variables with mean 1/A and probability distribution 

f Si (x) = Aexp(-Ax), VzGN. (3.4) 



Note the important memoryless property of the exponential distribution. It 
means that if a certain length of time has elapsed since a packet loss then the 
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probability distribution of the time interval remaining until the next packet 
loss is still given by (13 .41) regardless of the elapsed time. 

I now derive the formula that connects consecutive window values before 
losses. Let us denote by W; = W{t~) the window value immediately before 
the i th loss event and its distribution by fwi(w). If the value of the ran- 
dom variable Si is small enough, the next Wi+\ can be obtained from (13.31) . 
However, if Si is so large that the window would grow above the upper limit 
B then packets will be dropped at the buffer, and the Wi+i will be set to 
B. Accordingly, a mapping can be given that connects the consecutive Wi 
values: 

(cWr +1 + a(m+l)Si)^ ^ , 

b if^H-ffr 1 - 

1 — a(m+l) 

(3.5) 

In this manner, the time elapsed until the next packet loss might be smaller 
than Si if Si is too large. Due to the property of the distribution (13.41) noted 
above, at the succeeding application of (I3.5P the next Si + % time interval can 
be drawn from distribution (13.41) again. 

The next "before loss" window distribution fw i+ i( w ) can now be calcu- 
lated by the Perron - Frobenius operator, C, of the mapping: 

B oo 

f Wi+1 {w) = CT Si (W i+1 ) = J j S(w- T x (w')) f Wi (w') f Si (x) dxdw' (3.6) 



where S() is the Dirac-delta distribution and I averaged over the distribution 
(13.41) . After substituting ( 13 M into ( 13.61) we have to consider the condition 

W m+1 —cW m+1 

< Si = i+ ^, (m+1 )' — • This provides us with Wi < W i+ i//3 which should be 
taken into account in the upper boundary of the first integral. The integration 
in x can be carried out: 



[B,w/p) 

fw i+1 (w) = —w m e~ Sf^+i) j fwAw') e'^+i) dw' 



mm 



a 



o 



(3.7) 

B 

+ S(w — B) e «("•+!) / f w . (w ) e dw . 

o 
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Let Wbx = Hindoo W n denote the stationary limit of the "before loss" 
window sequence. Its stationary distribution, fw hl (w), is the fixed point 
solution of (13.70 . For finding the fixed point solution observe that for any 
probability distribution fwi( w ) the transformed one, fw i+1 (w), will contain 
a Dirac-delta term S(w — B) because of the second term of (13 . Tj) . I therefore 
use the following ansatz for the stationary distribution 

f Wh l H = A(X/a, B) 5(w -B) + <j>(w), (3.8) 

where 4> : [0,B] — ► K is a continuous regular function and A(X/a, B) is a 
constant. The delta function represents those points where the packet loss 
occurs in the buffer and the value of the pre-loss window is B. The constant 
A(\/a,B) represents the probability that a packet gets lost in the buffer, 
and it might depend on the external loss A/a and buffer size B. I am going 
to present the detailed interpretation of A(X/a, B) in Subsection 13.3.11 

Applying the probe function (13.81) in (13 .7p and separating the regular and 
5{w — B) terms we obtain 

B 

A(X/a,B) = e Q < m + 1 ) <j>(w ) e «( m+1 > dw >' + A(A/a, B) e , (3.9) 

o 

min(s,iu//3) 
\ , m+i r , im+l 

/\ Am ^_ j Xcw 

<p(w) = — w m e a ( m + 1 ) / (fr(w ) e dw 
a J 
o 

\ A( tu m+1 - c S m + 1 ) 

+ A(\/a,B)-w m e Q(w - /3B), (3.10) 

a 

where Q(x) is the Heaviside step function. Notice that for w e\(3B,B] the 
upper limit of the first integral is independent of w and the Heaviside function 
equals 1. The functional form of the unknown function <p(w) on this interval 
can therefore be resolved. Only the value of the definite integral — which is 
a constant — should be determined. If we look for a solution on the adjacent 
interval ](5 2 B 1 (3B] we can see that the upper bound of the first integral falls 
in the range ](3B,B], where the functional form of the unknown function 
was previously found. Again, only the value of a definite integral is to be 
found. Repeating these steps recursively one can see that the solution for 
the integral equation (13.101) would be simplified if one looked for the solution 



52 



CHAPTER 3. TRAFFIC DYNAMICS IN FINITE BUFFER 



on disjoint intervals }/3 n+1 B, /3 n B], n G N. Accordingly, let us define the 
following functions: 



<p n (w) = (j){w) X]/9n+lfl, j gn5|(w), 
(3 n B 

S n {&) — / <f>(w)e < m+1 ) dw, 
o 



(3.11) 
(3.12) 



where Xh(w) denotes the indicator function of set H C R, and the constant 



/3 n S 

-^n = S n (—cX/a) = / 0(u>) e Q < m+1 > <iw. 





(3.13) 



By applying the newly introduced definition of Iq in (13.91) we clearly have 



A(X/a,B) 



XB m + x 
g a(m+l) 



A(l~c)_B m + 1 
]^ — g a(m + l) 



(3.14) 



This formula with the above mentioned properties of (I3.10p in the interval 
]{3B,B] provides us with 



(j) (w) 



A w m e «C m + 1 ) 



A(l-c)B m + 1 
^ — g c*(m+l) 



/n. 



(3.15) 



Furthermore, for n G N, n > the recursion 



A 



a 



w/f3 



(w) = - w m e Q ( m+1 > + / 4> n -i(w') e Q ( m+1 > dw' 



/m+1 



(3.16) 



can be derived easily, since the Heaviside function in (|3.10l) is identically 
zero if w G [0, /3J3[. In order to apply the above recursion one should know 
constants /„, which in turn can be obtained from functions S n (s). If we insert 
A3. 10r > into the definition of S n (s) then we get 



So(s) 



1 + as/A 



Sq{sc) + A(X/a, B) E s 



B' 



m+1 



m + 1 



(3.17) 
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where E(x) = e cx — e x , and for all n 6 N, n > 

l / ( r. I A U"B m+1 \ 

= 7^ 7T S n-i(sc) - e-( s+ «)^rri n ^ . (3.18) 

1 + as/ A \ / 

Using (13.14p and (13 . 151) as initial conditions the recursive expressions 
(13.161) and (13.180 can be solved. In order to start the iteration the value of 
the initial condition A(X/a,B), or equivalently Iq = So(—cX/a) is needed. 
In the interest of finding J I calculate the function Sq(s) next. If we suppose 
that Sq(s) is continuous at s = then, using A3. 1T[) . it can be proven by 
induction that 



oo 1 



_ + sc k a/X 

(3.19) 

m+l x ' 



+ A(\/a, B)Ye[ sc k - FT p— , 

where I have used that Hhitv-^oo S (sc N ) = S (0) for c G [0, 1[. Furthermore, 

B 

So(0) — / 4>(w)dw = 1 — A(X/a, B), because /iy bl (w) is normalized, 
o 

The function Sq(s) is bounded because it is defined via the definite inte- 
gral of the regular function <p(w). The pole at s = —A/a on the right hand 
side of ( I3.17p must therefore be canceled by the subsequent factor: 

(\ p> m+l \ 
--——[ ) = °- (3-20) 

In addition, So(— cX/a) can be obtained from (13. 19h . As a result, 

A B rn+1 



_ c k+i 



(\ JDm+l \ , \ 1 

-a— i h^-^.^n— 
/ k=0 



fc=o \ / 1=0 



is acquired. We can express A(X/a,B) now as 



A(X/a,B) = l — (3.22) 

1-icG 



A B m + 1 
a m+l 
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where L(c) has been defined earlier in (12.291) . and 



oo k _ 

G{x) = Y J E{-c k x)\{ Y - d (3.23) 

k=0 1=1 

with the convention that the empty product equals 1. Note that in (13.221) 
the parameters appear only in the XB m+1 /a combination. This expression is 
the control parameter in my model. Systems in which external packet losses 
and buffer sizes differ, but the \B m+1 /a product is the same, are similar in 
the sense that they can be described with the same constant A[\B m+l /a). 



3.3 Discussion 



3.3.1 The interpretation of A(-) and the effective loss 

Now I present a brief explanation of the meaning of A(\B m+1 /a) and high- 
light its importance. First I calculate the average time elapsed between two 
packet-loss events. Remember that the inter-loss times on the link Si are 
IID random variables with exponential distribution. However, the buffer can 
induce extra packet losses. If the congestion window was W^x at the previ- 
ous packet loss then the maximum inter-loss time is clearly — — ^^+1)' ' 

at which time the buffer becomes congested. The exponential distribution 
of Si is truncated above this upper limit, and the probability that Si exceeds 
this limit is concentrated at the maximum inter-loss time. Consequently, 
the conditional probability distribution that a packet gets lost at either the 
buffer or the link after S' time, supposing that the value of the congestion 
window was W-^x at the previous packet loss, can be written as 



f S '\w hX {x,w) = S \x 



a, (m + 1) 



m + l 



+ Ae 



-Xx 



i-e \x- 



a (m + 1) 



(3.24) 
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With the help of the total probability theorem and ( 13.241) the average inter- 
loss time can be given by 

oo oo 

E[6') = J J xf S '\w hX {x,w)f Whl (w)dxdw 



oo 

= /AV I bL ^ 

o 

1- A(XB m+1 /a) 
~ A ' 

where (I3.9P has been used for replacing the last integral. 

The meaning of this simple expression becomes clearer if we recognize that 
A' = 1/E[5'] is the total packet loss rate — link and buffer losses combined. 
Therefore, 

± = l-A{\B m+1 /a), (3.26) 

which can be interpreted as the ratio of the number of packets that are lost 
at the link and the total amount of lost packets. Similarly, A(\B m+1 /a) 
is the ratio of the number of packets that are lost at the buffer Abuffcr and 
the total packet loss N tota \. The possibility that this ratio can be estimated 
from my model is the main result of this section. This interpretation and the 
exact knowledge of the form of A{\B m+l / a) allows us to treat buffer-losses 
as if they were link-losses. It also makes it possible to calculate the total loss 
along a multi-buffer, multi-link route. 

According to (13.261) the measured 1 — A/ A' expression should be equal to 
A{\B m+1 / a) and it should not depend on A and B separately, but only on 
the \B m+1 /a product. In order to verify (13.261) I carried out a number of 
simulations with different A and B parameter values in the 1 < \B m+1 /a < 
10 parameter range for both LAN and WAN network configurations. 

The parameter settings of the present model are the same as those of the 
infinite-buffer model in the previous chapter. In particular, for LAN scenar- 
ios: m = 1, ft = 1/2 and A/a = p. Setting the value of the new parameter B 
requires extra care, however. When comparing simulations and the formula 
(13.261 ) we have to take into account that in reality the system can store more 
packets than the actual buffer size. For example, the receiver is processing 
one packet, and even if the link delay is zero, one acknowledgment packet 
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Table 3.1: Fitted b parameter values for different buffer sizes B. The average 
value is b = 2.5354. 



B 


b 


30 


2.5045 


40 


2.5790 


50 


2.6798 


60 


2.4997 


70 


2.4143 



is traversing back to the sender during the file transfer, increasing the max- 
imum number of unacknowledged packets in the system by two. Moreover, 
TCP detects packet loss one RTT later than it actually happens, causing 
overshoot of the maximum window. The difference between simulation and 
fluid approximation can also cause some discrepancy. In other words, TCP 
behaves as if the buffer would be bigger than it really is. The effect of this 
behavior can be observed in Fig. 3.2(a) where the congestion window occa- 
sionally exceeds the buffer size B. 

In order to treat this problem I assumed that we have to set the congestion 
window limit to B = B + b Ll where b L has been fitted for different buffer 
sizes B. The fitted values of bz can be found in Table 13.11 It can be seen 
that b L is constant and practically independent of B. Based on simulation 
results I set &l to its average value b^ = 2.5354. 

Simulation results are shown in Fig. 13.31 where I compare the theoretical 

formula for A [pB 2 ^ and the ratio Nb n s eT /N t ota,i measured by ns-2. Abuser 
and A total are the number of packets dropped at the buffer and the total 
number of dropped packets, respectively. In the simulated parameter range 
I obtained an almost perfect match. 

Now I turn to the WAN scenario. Notice that the time could have been re- 
placed with "ACK time" in the previous arguments concerning A[\B m+l / 'on 
and the effective loss. In addition idle periods affect neither the number of 
packets dropped at the buffer nor the number of packets lost at the link. 
Therefore, A(\B m+l /a) is basically related to the intrinsic "ACK time" dy- 
namics of TCP. Consequently, the ratio N^^ eT /N) total in a WAN scenario 
should be equal to A{\B m+1 / a) with the intrinsic parameters of TCP dy- 
namics m — 1, /3 = 1/2 and X/a = p. 



3.3. DISCUSSION 



57 




1 2 3 4 5 6 

P B 2 /2 



Figure 3.3: Comparison of the theoretical function A (p(B + &l) 2 ) and the 
measured ratio iVbuffer/-Wtotai obtained from numerical simulations in various 
LAN configurations. The control parameter pB 2 /2 has been varied in the 
range of 1 and 6, at various buffer sizes between B = 30 and 70. 



In an ideal WAN network the buffer size would be zero. However, in 
reality the buffer size B must be set to a positive number, otherwise packet 
bursts cannot go through the buffer and TCP shows pathological behavior. 
If the size of the buffer is smaller than the maximum value of the slow start 
threshold then the slow start mechanism can have a serious impact on the 
number of packets lost at the buffer. Indeed, sudden bursts of packets of 
the slow start mode might cause further congestions at the buffer, which, in 
turn, might induce another slow start. This cascade of slow starts lasts until 
the slow start threshold is reduced below the size of the buffer. 

In order to demonstrate this phenomenon I carried out simulations with 
such a parameter setting that 2DC/P = 60. The buffer size was B = 3, 10, 
and 30. Simulation results are shown in Fig 13.41 where I compared the 
theoretical formula A[\B m+1 /a) and the measured loss ratio iVbuffer/iVtotai- 
Data points deviate from the theoretical curve considerably when B = 3. 
Deviation from the theory is less for B = 10 than for B = 3, but it is still 
significant for larger values of the control parameter. Finally, the measured 
data points fit A(\B m+l /a) almost perfectly when B = 30. At the end of 
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Figure 3.4: The ratio N^ u s ei -/N tota \, obtained from ns-2 simulations, is plotted 
as the function of pB 2 /2. The theoretical function A{x) is shown for com- 
parison. Below B « aD the buffer cannot handle packet bursts produced by 
the slow start algorithm, therefore excess packet drops appear at the buffer. 
Dotted lines connecting data points at B = 3 and 10 are guides to the eye. 

this subsection I estimate the effective loss A' in the XB m+1 /a — ► oo and 
XB m+1 /a — > limits. The first is the infinite buffer case, when packets get 
lost only on the link. It is evident from (13.231) that in the x — > oo limit 

the — e x term dominates G(x). Therefore, A(X/a,B) ps e" m + 1 /L(c) if 
\B m+1 /a 3> 1, which implies that 

(A B m + 1 \ 
p a m+1 \ 
l + (3.27) 

The fraction of packets dropped at the buffer decreases at an exponential 
rate as the control parameter \B m+1 /a increases. 

The second case is the "extreme" bottleneck buffer limit, when packets 
only get lost in the buffer. From (13.261) and (13 .221) it follows that 

A' = A I 1 \ r- I . (3.28) 
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With the series expansion of L(c)G(x), derived in Appendix IA.ll we can 
write 



V 1 - c A B m+l 2 V" 



(3.29) 



1 - c B m+1 2 V" 
when X 2 B m+1 /a <C 1. In particular, in an ideal LAN scenario 



p' = ^I + Ip + °(p 2 B 2 ) (3-30) 



holds for the effective packet loss probability p' = X'/a in the p 2 B 2 <C 1 
limit. The first order approximation of this formula has been calculated in 



41l ] for the same bottleneck scenario. This is a further indication that my 
calculation is correct. Since I obtained (13.301) as a limit of my model, my 
work can be viewed as a generalization of previous studies. 



3.3.2 Histograms and probability distributions 

I continue in this section with the derivation of the congestion window distri- 
bution from Eqs. (13.14l) - (13.18l) . It is easy to see that the piecewise solution 
of (13.161) on the disjoint intervals can be written in the form 



<f> n (w) = -w m J2h nik e-^ wm+1 . (3.31) 

fc=0 



Note that functional form of (13 .3 II) is the same as (12.331) in the infinite buffer 
scenario. Substituting ( 13.311) into ( 13.161) we can derive recursive formulas 
for the constants h n ^ = h n ,k(X/a, B). The constants might depend on the 
parameters A/a and B as I denoted explicitly. After the substitution we 
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acquire 




(3.32) 



It can be seen that after the recursive step in (13.321) only the required const x 

^ 

e ~a^+i w type terms appear. Comparing the coefficients on both sides 
term by term we receive the following equations: 



ft , = A(X/a, B) ei«r (3.33) 

K»,o = /.„ + ± J&f"Wm- (3.34) 

k=0 

h h ^ 1 

h n+i,k = - r^zfei = n ~fc +1 '° II l _ 1 = L (°) h ki c ) K-k+1,0, (3.35) 



where h k (c) and L(c) are defined in (12.290 and (12.301) . In order to complete 
the system of recursive equations we have to provide constants /„. The 
constant Iq can be obtained from ( 13 .14[) : 



A s m + 1 A cfl m+1 ' 

I = A(X/a,B) ( e -^+^ - e«^+^~ ) , (3.36) 
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while for n G N (13.131) can be applied: 



n B 

f A em m + 1 

I n+ i=I n - / n (w)e« ™+i dw 



,8*1+1 £ 



m+1 / , \ \ sm+1 \ n 



n.k 



C k — C 



g V J a m+1 — g V ) ol m + 1 

k=0 

= I n -± E (c- (c- k - c) -^^) (3-37) 

Although the number of the coefficients is infinite, we can use the first 
few in practice. Since the smallest congestion window value is 1, no more 
than log 2 B number of 4> n (w) functions are relevant and the inequality k < n 
implies that k is also limited. Furthermore, it is obvious from (13.351) that 
for every n the absolute value of h n ^ decays very quickly as k increases, so 
h n ,k ~ can be supposed if k > 3. 

So far I calculated analytically the distribution of the "before loss" values 
of the congestion window. In practice the distribution of the congestion win- 
dow at an arbitrary moment is relevant. I calculate this distribution fw( w ) 
here. We can basically repeat the same arguments as in Sec. I3.2L In gen- 
eral, between losses, the congestion window is developing according to (13.31) . 
where r is a uniformly distributed random variable on the random inter- 
val [0,p\. The conditional distribution of r — supposing that p is given — is 
f T (t | p = x) = ±Xlo,x]{t)- The distribution of p is f p (x) = g^jA(ac), simi- 
larly to the infinite buffer scenario. Thus, the distribution of the congestion 
window at an arbitrary moment can be given by the following transformation 

B oo oo 

fw(w) = / / 5(w - T t (w')) f T (t\p = x) f p {x) fw hA .( w ') dtdxdw' 





B oo x 

1 



E[8i] 




5(w - T t (w')) fsi(x) fw h A w> ) dtdxdw', (3.3£ 



ooo 



where T T (W hX ) = [W hA m+1 + a (m + 1) r] m+1 is the forward mapping of the 
congestion window from W^a. to W, r time later. The integration in variable 
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t can be carried out: 

/^- r ^,*-^o(,-==5z2p)[i-e(^-=) 



(3.39) 

The w m /a term is from the inverse- Jacobi of T t (w'), and the Heaviside func- 
tions correspond to the range of integration t = ^'"^"^ — G [0, x\. There- 
fore, ( 13.381) can be written as follows: 

min(B,w/b) oo 

fw(w) = ^-w m [ [ Xe~ Xx f Wx Aw')dxdw' 



w m+l_ w /rn + l 
01(772+1) 



(3.40) 



min(B,w/b) 
m+i r \ /m+l 



1- A{X/a,B)a 



w m e <* ™+! / fw hl {w')e a m + 1 dw' 



where I have used (|3.25l) . The implicit definition of fw hl ( w ) given in (13.7ft 
and (13. 8ft yields that 

It can be seen that the final distribution is proportional to the regular 
part of the "before loss" distribution, so I can apply my earlier results given 
in (13.331) - (13.371) again. There is no Dirac-delta distribution in (13.41ft . 

In the B — ► oo limit the derived formula (13 .41 j) should converge to (12.331) . 
the distribution derived for the infinite buffer scenario in the previous chap- 
ter. Let me confirm that my result is consistent with the infinite buffer case. 

I showed before that A(\/a, B) fa e~«^/L(c) if \B m+1 /a > I. Therefore, 
A(X/a, B) — > if B — > oo, which means that fw( w ) — li m n^oo <Pn(w). Fur- 
thermore, (13.361) and (13.371) imply that linig^^ I n = 1/L(c) for all nGN. Us- 
ing these results one can see from (13.331) - (13.35ft that lirn^oo lim^^ h n ^ = 
h k (c) for all k G N. 



Local Area Networks 



In order to verify my results I carried out simulations with ns-2. I have 
applied my results for both LAN and WAN networks. Let us consider the 



3.3. DISCUSSION 



63 



0.045 r 

0.040 
0.035 
0.030 
0.025 
0.020 
0.015 
0.010 
0.005 
■ 



fe 
/.i i 




20 30 40 50 60 70 



0.045 r 
0.040 
0.035 
0.030 
0.025 
0.020 
0.015 
0.010 
0.005 




IP 

fW3 




10 20 30 40 50 60 70 



(a) pB 2 /2 = 



(b) pB 2 /2 = 0.5 




10 20 30 40 50 60 70 



(c) pB 2 /2 = 1 



(d) pB 2 /2 = 2 




(e) pB 2 /2 = 3.5 



(f) P B 2 /2 = 5 



Figure 3.5: Comparison of simulation results and theoretical model at buffer 
size B = 60. fwniw), n 6 N denote piecewise solutions of the congestion 
window distribution f)3.4ip . 
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LAN scenario first, where the model parameters are m — 1, (3 — 1/2, X/a — p 
and the effective buffer size is B = B. I compare the simulation results and 
my model in Fig. 13.51 at pB 2 /2 = 0.0,0.5,1.0,2.0,3.5 and 5.0 parameter 
values and at B = 60 buffer size. The link capacity C = 256kb/s, link delay 
D = 0s and packet size P = 1500byte were fixed in the study of LAN and 
only buffer size B and loss probability p were varied. 

For the interpretation of the simulation results I consider the effect of 
the FR/FR algorithms as well. I showed in the previous chapter that in 
the case of an infinite buffer the effect of FR/FR algorithms can be taken 
into consideration by the modified distribution (12.461) . The finite buffer 
case can be handled similarly, with two minor adjustments. Firstly, the 
packet loss rate A should be replaced by the total loss rate A' = j^a, be- 
cause plateaus of the FR/FR mode appear after packet losses happening 
at the buffer, too. Secondly, the distribution of the "after loss" conges- 
tion window fw al (w) should be calculated directly from W a .i. = /3Wb.i. now: 
fw al {w) = fw hl ((3~ l w)(3~ l . Accordingly, fw{(3~ l w) have to be replaced by 
fw^XP^w) = A(\/a,B)S(p- 1 w-B)+(l)(p- 1 w) in (12T46D . After a variable 
transformation in the delta distribution we obtain 

fwH + sg j^S) ^ - gg> + ^r (m+1) ^ m MrM 

fw( w ) - ' a E[W^]~ ' 

a l-A(X,a,B) 

(3.42) 

where (13.411) has been used implicitly. 

The most distinct consequence of FR/FR algorithms is the sharp peak 
in the middle of the histograms in Fig. 13.51 In analytic formula ( 13.421) the 
peak is represented by a Dirac-delta distribution. The delta-distribution has 
been scattered over a finite region in Fig. 13.51 in order to be comparable 
with the peaks in the numerical histograms. The derived analytic expres- 
sion shows very impressive agreement with the numerical simulations. The 
slight discrepancy at larger packet loss probabilities comes from the differ- 
ences between the fluid model and the packet level simulation, discussed in 
Section 12.5.11 

The effect of link delay 

I assumed in the previous analysis that the link delay is zero. My results can 
be applied as approximations for situations where the link delay is non-zero, 
but the probability that the buffer is empty is negligible. In addition to the 
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Quantilcs of cwnd in LAN Quantilcs of cwnd in LAN 



(a) P B 2 /2 = 0.5 (b) P B 2 /2 = 2.5 



Figure 3.6: Quantile-quantile plot of the congestion window in network con- 
figurations with different buffer size B and bandwidth-delay product 2aD, 
but with the same effective buffer size B = B + 2aD = 90. 



buffer, 2aD number of packets and acknowledgments can be found on the 
link where a = C/P. C, D and P are the link capacity, the link delay and 
the packet size respectively. The congestion window limit in this situation 
must be set to the total number of packets B = B + 2aD in the system and 
the link can be treated as a part of the buffer. This can be verified with 
simulations. In my simulation scenario 2aD = 15 and 2aD = 30 number 
of TCP and ACK packets could be on the link. The buffer size was set to 
B = 75 and B = 60 respectively, so that the effective buffer size B = 90 
was the same. In Fig. 13.61 quantile-quantile plots of the congestion window 
are shown. Percentiles of the congestion window are plotted at the given 
link delay and buffer size combinations as the function of the percentiles of 
cwnd in an ideal LAN scenario. Data was obtained from simulations at two 
different control parameter values. It can be seen that data points are close 
to the diagonal, drawn by dotted lines. This implies that the data points are 
from the same distribution when the link delay is zero and when it is small, 
but not zero. Some deviation from the diagonal can only be observed at the 
lower quantiles ofpB 2 /2 = 2.5, when B = 60, aD = 30, because the buffer 
is occasionally empty in this case. 
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Figure 3.7: Comparison of simulation results and the theoretical model. The 
link could carry maximal 2aD = 60 number of TCP and ACK packets, and 
the buffer could store B = 3 packets, fw^w), i G N denote the piecewise 
solutions of the congestion window distribution. 
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Figure 3.8: Comparison of simulation results and the theoretical model. The 
link could carry maximal 2aD = 60 number of TCP and ACK packets, and 
the buffer could store B = 30 packets. fwi( w ), «£N denote the piecewise 
solutions of the congestion window distribution. 
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Wide Area Networks 

In a WAN scenario buffering delay is small compared to the link delay. As 
I noted in the last section, however, it cannot be set to zero, because in a 
packet level simulator packet bursts appear inevitably. Accordingly, the link 
delay was so large that it could carry 2aD = 60 TCP and ACK packets 
simultaneously. Furthermore, two buffer size values B = 3 and 30 were 
selected for numerical simulations. The analytical formula for the congestion 
window distribution can be obtained from the modification of (12.511) for the 
FR/FR algorithms, analogously to the LAN scenario. Other parameters of 
the WAN model are m = 1, A/a, and (3 = 1/2. 

The theoretical distributions and histograms obtained from ns-2 simula- 
tions can be seen in Fig. 13.71 at B — 3, which is close to the ideal WAN 
scenario. The external loss rate was varied in the < pB 2 /2 < 5 range. All 
other parameters were fixed. One can see that the histogram deviates from 
the theoretical distribution even for small values of the control parameter. 
The non-zero probability in the histogram that the congestion window is 1 
implies that the slow start mechanism is responsible for the discrepancy. In 
Fig. 13.81 empirical histograms are compared with the theoretical distribution 
at B = 30, which is an intermediate configuration between LAN and WAN. 
The effect of slow start mode is much less significant than at B = 3. 

The main source of error is the macroscopic probability of slow start 
mode. The other observable difference from experiments comes from the 
slight discrepancy in the position of the Dirac-delta and the finite peak in 
the histogram. Despite these errors my model agrees with simulations for 
small loss probabilities and gives a qualitatively correct description of the 
WAN situation for larger ones. 

3.4 Conclusions 

In this chapter I investigated the TCP congestion avoidance algorithm in 
networks where the finite buffer size limits the maximal achievable congestion 
window size. The most important development I accomplished in this study is 
that the total loss felt by TCP, including the buffer and the external packet 
loss, can be predicted from the network parameters, namely the length of 
the buffer and the probability of external packet loss. This formula makes it 
possible to calculate the total loss along a multi-buffer, multi-link route. 
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The presented analytical expression, A(x), can be computed numerically 
without difficulty and the total loss can be calculated by a simple formula. 
I also showed that A(x) and the coefficients which appear in the probability 
distributions depend only on a certain combination of the parameters. This 
combination is the control parameter in my model. Networks with the same 
control parameters are equivalent in the sense that the same portion of the 
total packet loss occurs at the buffer, and the coefficients are the same in the 
distribution function. 

In addition, I derived the stationary probability distribution of the conges- 
tion window process analytically in LAN, in WAN, and in general situations. 
New types of congestion window distributions are discovered when the packet 
loss in the buffer is large compared to other sources of packet loss. These are 
different from the usual Gaussian-type single humped distributions and my 
findings can help to develop a qualitative classification of window distribu- 
tions. I validated my calculations with computer simulations and I showed 
that my analysis agrees with the simulations properly. I also pointed out the 
limits of my model. More specifically, I demonstrated that the effect of the 
slow start mechanism becomes significant if the buffer size is small or the 
packet loss probability is large. 
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Chapter 4 

Traffic dynamics on complex 
networks 

The focus of the previous chapters was on TCP dynamics. The model of 
network topology was very basic, consisting merely of one buffer and one link. 
All details of the network topology were concentrated into a few parameters 
of the link, namely the link delay, bandwidth and packet loss probability. 
These effective parameters could be tuned freely in the model. However, 
we do not know yet how these parameters should be adjusted in a complex 
network of thousands of nodes. 

Since finite buffers naturally induce packet losses a long TCP session 
eventually achieves an equilibrium at a certain loss probability. For a fixed 
network configuration and a system of TCP connections, therefore, packet 
loss probabilities are determined by the steady state of network traffic. The 
steady state of the system is heavily influenced by the allocation of the net- 
work resources, especially the link capacity. 

In this chapter I study what the optimum distribution of link capacity 
is in certain types of evolving networks when the local structure of the net- 
work is known. The motivation behind this problem is that the Internet is 
basically being developed locally. In my model I suppose that optimum link 
capacity is proportional to the mean traffic demand of the particular link. In 
a homogeneous network the average traffic demand, in turn, is proportional 
to the expected number of flows that utilize a particular link. Since rout- 
ing of packets in computer networks can be supposed to be via the shortest 
path between end nodes it follows that the distribution of shortest paths is 
a matter of importance. 
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The main subject of my investigation is the " betweenness" of links, which 
is to say the number of shortest paths that pass over a link. Note that 
edge betweenness is essential not only in the case of the Internet, but in 
other complex networks too. For instance, edge betweenness can measure 
the "importance" of relationships in social networks or the probability of 
discovering an edge during a network survey. Until recently, however, less 
attention has been paid to edge betweenness. 

The probability distribution of edge betweenness gives a rough statistical 
description of links and it characterizes the network as a whole. Therefore, it 
is an important tool for an overall description of links in complex networks. 
However, if the local structure of the network is known — as I suppose in 
my model — then the probability distribution of edge betweenness under the 
condition of the local property provides a much finer description of links than 
the total distribution. Therefore, I will aim at the conditional distribution 
of edge betweenness. 

I restrict my model to trees, that is to connected loopless graphs. The 
simplicity of trees allows analytic results for edge betweenness, since the 
shortest paths in trees are unique between any pair of nodes. Although trees 
are special graphs, a number of real networks can be modeled by trees or 
by tree-like graphs with only a negligible number of shortcuts. Important 
examples of such networks are the ASs in the Internet 

As a model of evolving scale-free trees I consider the Barabasi-Albert 



(BA) model extended with initial attractiveness [5J, [55|. The scaling prop- 
erties of the network can be finely tuned with initial attractiveness. Note that 
in the limit of initial attractiveness to infinity the network loses its scale-free 
nature and becomes similar to a classical Erdos-Renyi (ER) network with 
Per = 2/N. Therefore, scale-free and non-scale free networks can be com- 
pared within one model. For the sake of simplicity the infinite limit of initial 
attractiveness is referred to as the "ER limit" hereafter. 

The rest of this chapter is organized as follows. Important results of 
the literature concerning network modeling are collected in Section 14.11 In 
Section 14.21 a short introduction to the construction of BA trees is given. 
Simulations of large scale complex networks are presented in Section 14.31 to 
illustrate the importance of optimum capacity distribution. My results are 
presented in Section I4.4L In particular, a master equation for the joint dis- 
tribution of cluster size and in-degree of a specific edge is derived and solved 
in Section 14.4.11 and Section 14.4.21 respectively. The total joint distribution 
of cluster size is calculated in Section 14.4.31 The marginal and conditional 
distributions of cluster size and in-degree are derived in Section 14.4.41 and 
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Section 14.4.51 respectively. In Section 14.4.61 the conditional distribution of 
edge betweenness follows. Finally, I summarize my work in Section l4~7ol 



4.1 Preliminary results of topology modeling 

In the early 1960's Erdos and Renyi introduced random graphs that served 
as the first mathematical model of complex networks 56]. In their model the 
number of nodes is fixed and connections are established randomly. In one 
variant of the ER model every node pair is connected independently with 
probability Per(N). The probability depends on the size of the network in 
such a way that the average degree of nodes is fixed: (k) = PerN = const. It 
is obvious that the distribution of the degree of any edge is binomial, which 
tends to Poissonian distribution in the N — > oo limit. Several interesting 
properties of the ER model are well understood, including the relative size 
of the giant component, the threshold of connectivity, etc. Although the 
ER model leads to rich theory, it fails to predict the power law distributions 
observed in scale-free networks. 



4.1.1 The Barabasi— Albert model 

Barabasi and Albert proposed a more suitable evolving model of scale-free 
networks [5J7J,|58|]. The BA model is also based on random graph theory, but it 
involves two key principles in addition: a) growth, that is, the size of the net- 
work is increasing during development; and b) preferential attachment, that 
is, new network elements are connected to higher degree nodes with higher 
probability. In the original BA model every new node connects to the core 
network with a fixed number of links m and the probability of attachment is 
proportional to the degree of nodes. The above rules can be translated into 
the following approximating fluid equation, which describes the time evolu- 
tion of the degree of a particular vertex: dki(t)/dt = ki/2t. The solution 
yields fcj(t) = m (t/tj) ' 5 , where t; is the time instant when the ith vertex was 
added to the network. The degree distribution can be given, supposing that 
new nodes are added uniformly in time, by: P [ki(t) < k] = F [ti > m 2 t/k 2 ] = 
l—m 2 t/k 2 (t + mo), where mo is the number of initial vertices. The probabil- 
ity density can be obtained from P(A;) = <9P [ki(t) < k] jdk. The stationary 
solution finally gives 

2m 2 

m = - w . (4.i) 
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The BA model explained successfully the observed scale-free nature of 
many networks by the "rich-gets-richer" phenomenon. However, the model 
was too simple to fit most measured quantities of the real Internet. For 
example, the degree scaling-exponent in (14.11) is 5ba = 3 which is in contrast 
with the exponent 5 = 2.15-2.2 observed in Internet measurements 0, l59| . 
The BA model was later refined by a number of other authors. Dorogovtsev 
and Mendes [60j studied the aging of nodes. The authors extended the BA 
preferential attachment rule so that attachment probability was proportional 
not only to the degree, but also to (t— U)~ v , a power law function of age, where 
v is a tunable parameter. It has been shown analytically and by simulation 
that the scale-free structure of the network disappears if v > 1. Moreover, 
an implicit equation was derived between the scaling exponent of the degree 
distribution and v for — oo < v < 1. The influence of exponentially fast aging 
on global and local clustering, degree-degree correlation and the diameter of 



the network was analyzed by Zhu et al. |6l| . 

A continuum model was developed by Albert and Barabasi 62j for the 
study of the effect of edge rewiring and appearance of new internal edges. In 
the extended model three operations are incorporated: a) m new edges are 
created with probability p, b) m existing edges are rewired with probability 
q; and c) a new node is connected to the network with m new links with 
probability 1 — p — q. In every step a node is chosen randomly first if [aj) 
applies and a random link of this node is removed if El) applies. In case 
of E|) the new node is chosen. Then a new link is established between the 
selected node and another one which selected with the following preferential 
attachment rule: 

The above procedure is repeated m times. 

The authors have observed a transition from a scale-free regime to an 
exponential regime in the (p, q) phase space. The transition takes place 
on the line q t = min [1 — p, (1 — p + m) / (1 + 2m)]. In the scale-free regime, 
where q < q t , the connectivity distribution has a generalized power-law form: 

P(fc) oc (k + A(p, q, m))" 7(?wn) , (4.3) 

where A(p, q, m) = (p — q) 2 ^~^ + 1 +p — q and 7(jo, q, m) = 3 — 2q+ 
In the limit p = q = the model reduces to the scale-free model investigated 



m 



5t| . It can be seen that the scaling exponent 7 changes continuously with 



p, q, and m in the range of 2 to 00. 
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The classic BA model has been extended with initial attractiveness by 



Dorogovtsev et al. [54|. More specifically, the probability that a new node is 
connected to a given site is proportional to A± = A + qi, where A > is called 
the initial attractiveness and is the in-degree of node i. The probability 
distribution of the connectivity, 

P(g) = (1 + a) r[(m + X \ a + 1] . V{q + ma) v (4.4) 
W K J r(ma) r[g + 2+(m+l)a]' V ' 

was derived from a Master-equation approach where a = A/m and m is 
the number of links starting from every new node, as in the BA model. In 
the special case a = 1 the model reproduces the original BA model with 
Ai = ki = qi + m and the solution (14.41) reduces to 

2m (m + 1) , 
P « = *(* + !)(* + 2) - <4 ' 5) 

Compare this result with (14.11) . which comes from a fluid approach. The 
two expressions converge in the k — ► 00 limit, but the constant factors 
are different. For ma + q ^> 1 the expression (14. 4K takes the form F(q) oc 
(q + ma)^^ 2+a \ that is the scaling exponent 7 = 2 + a can be tuned in the 
range of 2 to 00, similarly to the previous model. 

The time evolution of the average connectivity has also been derived. It 
has been found that q(t,ti) oc (tj/f) -1 ^ 1 for t ^> U. The scaling exponent 
of the average connectivity of an old node is therefore (3 = 1/ (1 + a). It 
follows that scaling exponents 7 and (3 satisfy the following scaling relation: 

/3( 7 -l) = l. (4.6) 

The authors have shown that (14.61) is universal, since the above scaling rela- 
tion can be derived in the case of more general conditions. 

Growing; random networks with non-linear attractiveness have been stud- 



ied in [63|, |64j. It has been found that scale- free connectivity distribution can 
be observed only if the attractiveness kernel is asymptotically linear. The 
authors confirmed the above findings indicating that the scaling exponent de- 
pends on the details of the attachment probability and can be tuned in the 
range of 2 and 00. Furthermore, the authors showed that if the attractiveness 
is sub-linear then the connectivity distribution decays at an exponential rate, 
while if the kernel grows more quickly than linearly then almost all nodes 
are connected to a single node. 
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4.1.2 Other network models 

Other mechanisms have been proposed for the formation of scale-free net- 
works. Evans and Saramaki [65fl studied the following simple algorithm: new 
vertices are connected to the end of one or more /-length random walk pro- 
cesses. Several variations for this general algorithm have been considered: 
fixed or variable length random walks, a fixed or random number of con- 
necting edges, different distributions for the starting vertex of the random 
walk process, edge- or vertex-wise restart of random walks, and uniform or 
weighted random walks on the graph. The authors argued that a random 
walk process is a more realistic mechanism than preferential attachment, 
since the random walk uses only the local properties of a network. 

Goh et al. [66] proposed the following stochastic model for the evolu- 
tion of Internet topology: the size of the network increased exponentially, 
N(t) = N(t )e at and the connectivity of each node is changed according to 
the random process 

ki(t + 1) = h(t) [1 + g 0>i + &(*)] , (4-7) 

where go,i are constants and are assumed to be independent white noise 
processes representing fluctuations with mean zero and correlation function 
{£i(t)£j(f)) = o-QjS(t — t')5ij. The authors showed that in a homogeneous 
case, that is when g$ = g^o and o"o = cr^o, the connectivity distribution of 
the network approximately follows a power law with exponent 

7 = l-^ + ^S + 5S, (4.8) 

where g c s ~ go — <7q/2, a^s ~ °o- Links are removed randomly when the 
degree ki decreases and internal edges are created according to a preferen- 
tial attachment rule when the degree ki increases. The model includes an 
adaptation mechanism in which links are only rewired to nodes with larger 
connectivity. The parameters of the model have been fitted to real AS level 
Internet topology. The authors have demonstrated that their model fits the 
degree-degree correlation and clustering coefficient of the real Internet better 
than previous models. 

In a paper by Li et al. [e| the authors argued that the technological 
constraints of router design should be considered as the driving force be- 
hind the development of the Internet. They pointed out that the possible 
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bandwidth-degree combinations are restricted to a technologically feasible 
region for every router. In particular, large bandwidth links are connected to 
low degree routers and as the degree increases router capacity must be frag- 
mented among more and more links. A heuristic degree-preserving rewiring 
algorithm has been proposed by the authors in order to take the above tech- 
nology constraint into consideration: a small number of low degree nodes are 
chosen to serve as core routers first, and other high degree nodes hanging 
from the core routers are selected as access routers next. Finally, the connec- 
tions among gateway routers are adjusted in such a way that their aggregate 
bandwidth to core nodes becomes almost uniform. The resulting Heuristi- 
cally Optimal Topology (HOT) has been compared with other commonly used 
topology generators, e.g. BA preferential attachment network, and general 
random graph model. Performance metrics and random graph-based likeli- 
hood metrics have been defined to compare different topologies, which are 
the realizations of the same degree distribution. It has been shown that the 
overall network performance of the HOT topology surpasses the performance 
of other random networks. At the same time, the "designed" HOT topology 
is very unlikely to be obtained from random graph models, according to the 
defined likelihood metric. The authors concluded that their first-principles 
approach combined with engineered design should replace random topology 
generators in the future. 



4.1.3 Earlier results regarding betweenness 



Node betweenness has been studied recently by Goh et al. [67J who argued 
that it follows power law in scale-free networks, and the exponent 5 « 2.2 
is independent from the exponent of the degree distribution as long as the 
degree exponent is in the range 2 < 7 < 3. The authors analyzed both 
static and evolving networks, directed and undirected graphs as well as a 
real network of collaborators in neuroscience. Their conjecture is based 
on numerical experiments. However, Barthelemy [68| presented counter- 
examples to the universal behavior and demonstrated that the important 
exponent is the scaling exponent of betweenness as the function of connec- 
tivity 1] = (5 — 1) (7 — 1) instead. In a reply [69| the authors argued that 
universality is still valid for a restricted class of tree-like, sparse networks. 

Szabo et al. 7(J used rooted deterministic trees to model scale-free trees. 
The authors have modeled BA networks with a uniform branching process 
in a mean-field approximation. They obtained that the branching process 
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is b(l) = Y^f- on a layer at distance I > 0. The number of nodes n(l) at 
distance I was approximated by a non-normalized Gaussian. It has been 
found that the number of shortest paths going through a node at distance I 
from the root node is L(l) = ^fr, independent of the branching process b(l). 
Finally, the authors showed that node betweenness, which includes shortest 
paths originating to and from nodes in excess of edge load, follows a power- 
law decay with a universal exponent of —2. The same scaling exponent has 
been found experimentally by Goh et al. [67J for scale-free trees. 



A rigorous proof of the heuristic results of [7Jj] has been presented by 
Bollobas and Riordan [7l|. The authors showed that the number of shortest 
paths through a random vertex is 

* £ = ' )= (g +7)^+3) - (49) 

where iV is the size of the network and I G N. Furthermore, the distribu- 
tion of the length of the shortest paths has been precisely calculated. The 
asymptotic limit of the distribution was proved to be normal with mean and 
variance increasing as logiV. 



4.2 The network model 

The concepts of graph theory are used throughout my analysis, so I will define 
briefly the terminology I use first. A graph consists of vertices (nodes) and 
edges (links). Edges are ordered or un-ordered pairs of vertices, depending 
on whether an ordered or un-ordered graph is considered, respectively. The 
order of a graph is the number of vertices it holds, while the degree of a 
vertex counts the number of edges adjacent to it. Path is also defined in 
the most natural way: it is a vertex sequence, in which any two consecutive 
elements form an edge. A path is called a simple path if none of the vertices 
in the path are repeated. Any two vertices in a tree can be connected by 
a unique simple path. The graph is called connected if for any vertex pair 
there exists a path which starts from one vertex and ends at the other. 

The construction of the network proceeds in discrete time steps. Let us 
denote time with r G N, and the developed graph with G T = (V T , E T ), where 
V T and E T denote the set of vertices and the set of edges at time step r, 
respectively. Initially, at r = 0, the graph consists only of a single vertex 
without any edges. Then, in every time step, a new vertex is connected to 
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the network with a single edge. The edge is directed, which emphasizes that 
the two sides of the edge are not symmetric. The newly connected node, 
which is the source of the edge, is always "younger" than the target node. 
The term "younger node of a link" is used in this sense below. Note that 
the initial vertex is different from all the others, since it has only incoming 
connections; I refer to it as the root vertex. 

The target of every new edge is selected randomly from the present ver- 
tices of the graph. The probability that a new vertex connects to an old one 
is proportional to the attractiveness of the old vertex v, defined as 

A(v) = a + q, (4.10) 

where parameter a > denotes the initial attractiveness and q is the in- 
degree of vertex v. It has been shown in [54| that the in-degree distribution 
is asymptotically P(g) ~ (1 + a) (q + a)~^ 2+a - > . I will improve this 

result and derive the exact in-degree distribution below. Note that in the 
special case a = the attractiveness of every node is zero except of the root 
vertex. It follows that every new vertex is connected to the initial vertex 
in this case, which corresponds to a star topology. The special case a = 1 
practically returns the original BA model. Indeed, except for the root vertex, 
the attractiveness of every vertex becomes equal to its degree if a = 1; this is 



exactly the definition of the attractiveness in the BA model [57J. Finally, if 
a — > oo, then preferential attachment disappears in the limit, and the model 
tends to a Poisson-type graph, similar to an ER graph. 

The attractiveness of sub-graph S is the sum of the attractiveness of its 
elements: 

A(S) = $>0O- (4.11) 

v'es 

I refer to a connected sub-graph as a cluster. The attractiveness of cluster C 
can be given easily: 

A(C) = (1 + a) \C\ - 1, (4.12) 

where |C| denotes the size of the cluster. It is obvious that the overall 
attractiveness of the network at time step r is 



A(V T ) = (l + a) (r + l)-l. 



(4.13) 
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4.3 Simulation of large computer networks 

Before I continue with the analytic study of betweenness I would like to illus- 
trate the effects of different capacity allocation strategies in large computer 
networks and demonstrate the importance of finding an optimum strategy. 
To this end I carried out large scale computer simulations. Since packet level 
simulations of large networks are practically impossible, because of their huge 
computational requirements, I implemented a fluid model of the network traf- 
fic based on the AIMD model, introduced below. 



Baccelli and Hong [72j have developed the AIMD model for N parallel TCP 
flows utilizing a common bottleneck buffer. The synchronization of the TCP 
flows could be tuned in the range of complete synchronization and complete 
randomness. The acronym AIMD stands for additive increase, multiplicative 
decrease. The name refers to the basic governing principle behind TCP 
congestion avoidance algorithm, and it emphasizes that the details of the 
slow start and the FR/FR algorithms are neglected in the model. Let T n 
denote the nth congestion time, r n+ i = T n+ i — T n the elapsed time between 

(i) 

two consecutive congestion events, and Xn the throughput of ith flow after 
the nth congestion event. If instantaneous throughput is approximated by 
its average, then the throughput can be related to the congestion window 
by the following equation: xjp = wjpP/R®, where R (i) is the RTT 
of the ith TCP flow, and P is the size of the data packets, as above. The 
evolution of the throughput can be given by 



where and (3^ are the linear growth rate and the multiplicative decrease 
factor of the congestion window, respectively, and t$ are random variables, 
independent in n, which take the value 1 if the ith TCP flow experiences 
packet loss at the nth congestion event, and otherwise. Congestion occurs 
at the bottleneck, supposing negligible or zero buffer capacity, when the total 
throughput reaches the capacity of the bottleneck link C. Accordingly, r n+ i 



4.3.1 The AIMD model 




(4.14) 
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can be calculated from following fluid equation: 

N , (*) P \ 

i=l ^ ' 

The variable r n+ i can be eliminated from (14.141) and (14.150 . which leads 

to 

where 7 « = (l - £») + and p« = E £^ (j) ■ 

recursive equations (14.161) can also be given in a simpler stochastic matrix 
form: 

X n+ i = A n+ i • X n + B n+1 , (4.17) 

where (B n ) i = 7^ p^C and (A n ) j . = 7 „ (5^ — pW) , and 8ij is the Kronecker- 
delta symbol. 

The interaction of the competing flows is taken into account by the syn- 

(i) 

Note that are not independent at a 




chronization rate, = E 



t(*) 



given n for z = 1 . . . N, since at a congestion event a minimum of one TCP 
flow must experience packet loss. If ^ are generated independently with 
P ($} = l) = ?Tn\ but those realizations are discarded where Y^hLi £n' ) = 0, 
then the synchronization rate can be expressed with the following conditional 
probability: 



^""'- i-ic'ci-^ - (4 ' 18) 



r{? = P Ui 4) = 1 



For the special case N = 1, for example, it is evident that r n = 1. 

Let us consider a simple homogeneous situation, where = a, (3^ = @, 
R® = R, and r ™' ) = r n- It is obvious that p^ — 1/N in this case. Moreover, 
it can easily be shown that the expectation of the steady state throughput 
is E [Xoo] = E [Boo], that is 

E[XW]=E[ 7 ]^ = [l-(l-/?)r]^ (4.19) 

for all i. The above formula predicts the degradation of the throughput as 
the synchronization grows. This is in good agreement with simulations and 
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measurements. The expected time between consecutive congestion events 
can also be obtained: 

C Rr 

E H = (1 -' 8 W- (4 - 20) 

The formula can be derived from the extended AIMD model as 

well. The functional form of the formula is 

x= pyvm. (4 . 21) 

R y/p 

The precise form of f(r, N) is rather complicated. However, it has been 
shown that liniAr^oo f(r, N) = 1 — r/4. This implies that for large N the 
constant factor Co = y/2f{r, N) varies in the range [y/3/2, V2] with the 
synchronization rate. 

The authors have presented a wavelet and an auto-correlation analysis 
for traces of the AIMD model for a large number of TCP connections. They 
concluded that the trajectory of the aggregated throughput shows multi- 
fractal scaling properties on short time scales and the wavelet and auto- 
correlation methods give consistent fractal dimensions. 

The single-buffer AIMD model can be generalized straightforwardly for 
numerical simulations of more complex networks. One only needs to apply 
(14.151) for each link and find the minimum of possible congestion events: 

n _ y(») 
r n+1 = mm — ^y^, (4.22) 



e&E 



where I e denotes the set of flows which utilizing link e G E. The flows of the 
congested buffer are handled the same way as in the original single buffer 
AIMD model. The remaining flows in the network develop undisturbed until 
the next possible congestion event. 



4.3.2 Performance of different bandwidth distribution 
strategies 

In this section different bandwidth distribution scenarios are compared using 
a fluid simulator based on the above AIMD model. The underlying network 
topology is the same in all scenarios: a scale-free network generated according 
to the extended BA model introduced in Section [4721 The parameter, which 
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Table 4.1: Link capacity and performance in case of different strategies. The 
assigned capacity is proportional to the quantity displayed in the second 
column, where q A , qB denote the in-degrees of the nodes which compose a 
particular link, and L e denotes edge betweenness. 



Strategy 


c e 


Q[b/s] 


Uniform 


oc 1 


740.79 


Maximum 


oc max(q A , q B ) 


2391.94 


Minimum 


oc min(g A , <7b) 


6574.69 


Product 


ocq A -qB 


5279.5 


Mean field 


oc L e 


11284.6 



controls the number of new links in the model, is set to m = 1, that is 
the resulting network is a tree. The scaling parameter is set to a = 1 for 
numerical purposes. In simulations link capacities are normalized in such a 
way that the average capacity is the same in all scenarios. 

The rules of different strategies are presented in Table 14. 1L The uniform 
scenario, when the capacity is the same for every link, is regarded as a ref- 
erence. It can be considered the worst case scenario, when no information is 
available about the details of the network. On the contrary, the mean field 
strategy — when the link capacity is proportional to the edge betweenness — is 
a global optimum. Minimum, maximum and product strategies are a couple 
of naive attempts to take the local structure of the network into account. 
Note that only one global information the normalizing factor for the average 
capacity is required. In the later three cases the more connection a link pos- 
sesses, the more capacity is allocated for the particular link. The difference 
between the three strategies is whether they prefer loosely, moderately or 
highly connected links, compared with the mean field allocation strategy. 

The capacity range that different strategies are more likely to prefer can 
be easily determined by the complementary distribution of capacities, shown 
in Fig. 14. 1L The average capacity is set to (C) = 10 5 [b/s] for all cases. The 
distribution of the uniform strategy is clearly degenerated since only one 
capacity value is possible in this scenario. The maximum strategy prefers 
the lower bandwidths at the cost of a cutoff at about 10 6 6/ s capacity. The 
minimum strategy also prefers lower bandwidths at the cost of high band- 
widths, but no cutoff exists. The complementary distribution of minimum 
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Link Capacity, C[b/s] 



Figure 4.1: Comparison of the complementary CDF of link capacity is shown 
for different bandwidth distribution strategies on log-log plot. Data is ob- 
tained from 10 realizations of N = 10 4 node networks. Average capacity is 
set to C = 10 5 [6/s] for every network. The following scenarios are considered: 
uniform (pentagons), maximum (diamonds), minimum (triangles), product 
(circles), and mean field (squares). 



strategy resembles the mean field distribution with a different scaling ex- 
ponent. The product strategy prefers the mid-range of bandwidth, and it 
underestimates both the low and the high capacity range, compared to the 
mean field strategy. 

In order to compare different strategies one needs an ordering between 
them. Performance, the average throughput of TCPs, provides a natural 
ordering between different strategies. Let us define the performance of indi- 
vidual TCPs first as the time average of their throughput X®(t): 

g(0 = (X (i) (t)\ = lim - / X®(u)du. (4.23) 

t->oo t Jq 

The global performance of a strategy is then the mean performance of the 
TCPs operating in the network 

Af TCp 

Q = m — ( 4 - 24 ) 



TCP 
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The locations of TCP sources and destinations are distributed homoge- 
neously in my numerical simulations. The length of a simulation is such 
that every TCP connection experiences 100 congestion epochs on average. 
Network performances obtained from simulations are shown in Table |4~T1 for 
the different bandwidth distribution strategies. The table shows that mean 
field bandwidth allocation strategy is almost twice as effective as the second, 
"minimum strategy", and it is more than twice as good as the "product strat- 
egy". The performance of a network with maximum bandwidth distribution 
strategy is about one fifth the performance of the same network when mean 
field strategy is used. Moreover, the performance of uniform scenario is even 
less then one third of the second worst, "maximum strategy". 

A more detailed picture can be gotten from the distribution of TCP- 
wise performance F(Q^). Simulation results of the cumulative distribution 
function (CDF) of TCP performance are shown in Figure 14.21 for the above 
mentioned bandwidth allocation strategies. The performance of mean field 
strategy is clearly the best. The bulk of the distribution is concentrated to 
a relatively narrow performance interval, that is most of TCPs can operate 
at almost the same, high performance level. The performance distribution 
of the next two best performing strategies, the minimum and the product, 
is very similar below their median. Above the median the minimum strat- 
egy performs better even though large capacity links are preferred less than 
the product strategy. It follows that the whole bandwidth range must be 
taken into consideration in any bandwidth distribution strategy to reach the 
optimum network performance. The performance of the maximum strategy 
is considerably worse than the previous two, mainly due to the sharp cutoff 
in the capacity distribution. Finally, the uniform bandwidth distribution is 
the worst of all: its performance is just a few percent of the mean field sce- 
nario's performance. The network where this strategy is applied is heavily 
congested, since the bottlenecks form in the core of the network. 

In summary, the selection of inadequate bandwidth allocation strategy 
can degrade the overall performance of the network considerably. In the fol- 
lowing sections I discuss analytically how additional local information could 
be used to allocate capacity to links properly. Beforehand, I introduce the 
network model investigated. 
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TCP performance, Q®[b/s] 

Figure 4.2: Comparison of the CDF of TCP performance is shown for dif- 
ferent bandwidth distribution strategies on normal-log plot. Data is ob- 
tained from 10 realizations of N = 10 4 node networks with scaling param- 
eter a = 1/2. Average capacity is set to C — 10 5 [6/s] for every network. 
Simulation lasted for 100iV congestion epochs. The following scenarios are 
considered: uniform (pentagons), maximum (diamonds), minimum (trian- 
gles), product (circles), and mean field (squares). 



4.4 Discussion 

It is my aim to derive the probability distribution distribution of edge be- 
tweenness in evolving scale-free trees, under the condition that the in-degree 
of the "younger" node of any randomly selected link is known. For the sake 
of simplicity I consider the in-degree of the "younger" node only. Whether 
a node is "younger" than another node or not can be defined uniquely in 
evolving networks, since nodes attach to the network sequentially. Note that 
the in-degree is considered instead of total degree for practical reasons only. 
The construction of the network implies that the in-degree is less than the 
total degree by one for every "younger" node. 

To obtain the desired conditional distribution I calculate the exact joint 
distribution of cluster size and in-degree for a specific link first. Then, the 
joint distribution of a randomly selected link is derived, which is comparable 
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Root 



Figure 4.3: Schematic illustration of the evolving network at time r. Vertex 
v, connected to the network at r e , denotes the root of cluster C. Variables q 
and n = \C\ — 1 denote the in-degree of vertex v and the number of nodes in 
C without v (marked by circles), respectively. 



with the edge ensemble statistics obtained from a network realization. The 
exact marginal distributions of cluster size and in-degree follow next. After 
that, I give the distribution and mean of cluster size under the condition 
that in-degree is known. For the sake of completeness the conditional in- 
degree distribution is presented as well. Finally, the distribution and mean 
of edge betweenness is derived under the condition that the corresponding 
in-degree is known. Note that all of my analytic results are exact even for 
finite networks, which is valuable since the real networks are often much 
smaller than the valid range of asymptotic formulas. Moreover, exact results 
for unbounded networks are provided as well. 

4.4.1 Master equation for the joint distribution of clus- 
ter size and in-degree 

Let us consider the size of the network N, an arbitrary edge e, which con- 
nected vertex v to the graph at time step r e > 0, and let us denote by C the 
cluster that has developed on vertex v until r > r e (Fig. 14.31) . The calculation 
of betweenness of the given edge is straightforward in trees, since the number 
of shortest paths going through the given edge, that is the betweenness of 
the edge, is obviously L = \C\ (N — \C\). Therefore, it is sufficient to know 
the size of the cluster on the particular edge to obtain edge betweenness. 
The development of cluster C can be regarded as a Markov process. The 
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W T , n>q = -^ri^- = _ , ., . (4.25) 



= "XTtTT = _ h , (4-26) 



states of the cluster are indexed by (n,q), where n — \C\ — 1 denotes the 
number of vertices in cluster C without v. The in-degree of vertex v is 
denoted by q. Transition probabilities can be obtained from the definition of 
preferential attachment: 

A (C T \v) n — aq 

A(V T ) ~ r + l-a 
A(v) aq + 1 — a 
A (V T ) ~ r + l-a 

where a = 1/ (1 + a) 6 ]0, 1] and W Ttn ^ q denotes the transition probability 
(n, q) — > (n + l,g), and W r q denotes the transition probability (n, g) — ► 
(n + 1, q + 1), respectively. 

The Master-equation, which describes the Markov process, follows from 
the fact that cluster C can develop to state (n, q) obviously in three ways: a 
new vertex can be connected 

1. to cluster C but not to vertex v, and the cluster was in state (n — 1, q), 

2. to vertex v, and the cluster was in state (n — 1, q — 1), or 

3. to the rest of the network, and the cluster was in state (n, q). 

Therefore, the conditional probability P r (n, q \ r e ) that the developed cluster 
on edge e is in state (n, q) satisfies the following Master-equation: 

f T (n,q | r e ) = W / T _i jn _i ;(? P T _i(n - l,q \ r e ) 
+ W;_ 1> ,_ 1 P T _ 1 (n-l,g-l |r e ) 
+ [1 - W^_i, n , g - W' T _ hq ] P T _i(n, g | r e ), (4.27) 

Since the process starts with n = 0,g = at r = r e , the initial condition 
of the above Master equation is f Te { n , Q \ r e ) = S n)0 5 qj0 , where 8^ is the 
Kronecker-delta symbol. 

4.4.2 The solution of the master equation 

After substituting the above transition probabilities into (14.271) . the following 
first order linear partial difference equation is obtained: 

(r - a) P r (n, q \ r e ) = (n - 1 - aq) P T _i(n - 1, q \ r e ) 

+ (aq + 1 - 2a) P T _i(n - 1, q - 1 | r e ) 

+ (r — n — 1) P T _i(n, g | r e ), (4.28) 
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Let us seek a particular solution of (14.281) in product form: /(r) g(n) h(q). 
The following equation is obtained after substituting the probe function into 
(14281) : 

f ^ f^ i 1 \9{n-l) 

{T -a) JF - T) -T=(n-l-a q )—^ r -n-l 

+ (a g + l-2a) ^^- 1) . (4-29) 

#(n) %) 

The above partial difference equation can be separated into a system of three 
ordinary difference equations. The solutions of the separated equations are: 

T(q+ 1/a- 1) , 

- r ( ; + A 2 / a + 1 V < 4 ' 32 > 

where Ai and A2 are separation parameters. 

The solution of (14.27p . which fulfills the initial conditions, is constructed 
from the linear combination of the above particular solutions: 



n 



q\r e )=J2 C ^M fif) g(n) h(q), (4.33) 



A1.A2 



where C\ lt \ 2 coefficients are independent of r, n and q. 

To obtain coefficients C\ lt \ 2 , the initial condition of ( 14.271 ) is expanded 
on the bases of g(n) and h(q). The detailed calculation is presented in Ap- 
pendix IA.21 

The solution of (14271) is 

rfr-r. + n T(r-n) 

F T (n,q I r e 



T(r e ) T(n + l)T(r-r e ~n + V 



1 (r + 1 — a) 1 — 1) 



where $ a (n, g) = J2l=o kl( q -k)\ (~ ak )n and ( x )n = T(n + x)/V(x) denotes 
Pochhammer's symbol. Note that P r (n, q | r e ) 7^ iff < q < n < r — r e . 
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The conditions < q and n < r — r e are obvious, since l/T(k) = by 
definition if A; is a negative integer or zero. Furthermore, the condition q < n 
can easily be seen if § a (n,q) is transformed into the following equivalent 
form: & a (n, q) = ■^■j^z n ~ 1 (1 — z~ a ) q \ z _ r This result coincides with the fact 
that the size of a cluster n cannot be less than the corresponding number of 
in-degrees q. 

4.4.3 Joint distribution of cluster size and in-degree 

Equation (14.341) provides the conditional probability that a particular edge 
which was connected to the network at r e is in state (n, q) at r > r e . In a 
fully developed network, however, the time when a particular edge is con- 
nected to the network is usually not known. Moreover, the development of 
an individual link is usually not as important as the properties of the link 
ensemble when it has finally developed. Therefore, we are more interested in 
the total probability P T (ra, q), that is the probability that a randomly selected 
edge is in state (n, q) at r, than the conditional probability (14.341) . The total 
probability can be calculated with the help of the total probability theorem: 

T 

F T (n, q) = Y, F r{n, q | r e ) P T (r e ), (4.35) 

r e = l 

where P T (r e ) is the probability that a randomly selected edge was included 
into the network at r e . According to the construction of the network one 
edge is added to the network at every time step, therefore P r (r e ) = 1/r. 
The following formula can be obtained after the above summation has been 
carried out: 

t + 1 — a (1/a — 1)„ 
P T (n, q) = — ' r >q $ Q (n, q), (4.36) 

T ( 2 - a )n+l 

where < a < 1. In star topology, that is when a = 1, the joint distribution 
F T (n,q) evidently degenerates to F T (n,q) = 5 n ,o ^o- 

The ER limit of joint distribution can be obtained via the a — > limit of 
(14.361) (see Appendix IA.3I for details): 

n-l ( k \ o(k) 

£j p T(n ,, ) = I±l E( _ 1) »«- l ^^ (4 .37, 



4.4. DISCUSSION 



91 




1 10 100 

Cluster size, n 



(a) Joint distribution of cluster size and in- 
degree as the function of cluster size. 




1 10 100 

In-degree, q 



(b) Joint distribution of cluster size and in- 
degree as the function of in-degree. 



Figure 4.4: Joint empirical distribution of cluster size and in-degree at a = 
1/2 (symbols), and analytic formula (14.361) (solid lines) are compared on 
double-logarithmic plot. Simulation results have been obtained from 100 
realizations of 10 5 size networks. 

where < q < n < r and Sn^ denote the Stirling numbers of the first kind. 
Note that for the special case n = q = the ER limit is lim Q ^ Pt(0, 0) = 

The above formulas have been verified by extensive numerical simulations. 
The joint empirical cluster size and in-degree distribution has been compared 



with the analytic formula ( 14.361 ) for a = 1/2 in Fig I4.4L Figures 4.4(a) 



and 4.4(b) represent intersections of the joint distribution with cutting planes 
of fixed in-degrees and cluster sizes, respectively. The figures confirm that the 
empirical distributions, obtained as relative frequencies of links with cluster 
size n and in-degree q in 100 network realizations, are in complete agreement 
with the derived analytic results. 

Equation (I4.36H is the fundamental result of this section. The derived 
distribution is exact for any finite value of r, that is for any finite BA trees. 
This result is valuable for modeling a number of real networks where the 
size of the network is small compared to the relevant range of cluster size or 
in-degree. If the size of the network is much larger than the relevant range 
of cluster size or in-degree then it is practical to consider the network as 
infinitely large, that is to take the r — > oo limit. For the above joint distri- 
butions ( 14.361 ) and (14.371) the r — ► oo limit is evident, since the r dependent 
prefactors obviously tend to 1 if the size of the networks grows beyond every 
limit. 
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4.4.4 Distributions of cluster size and in-degree 

I have derived the joint probability distribution of the cluster size and the 
in-degree in the previous section. In many cases it is sufficient to know the 
probability distribution of only one random variable, since the information 
on the other variable is either unavailable or not needed. It is also possible 
that the one dimensional distribution is especially necessary, for example for 
the calculation of a conditional distribution in Section 14.4.51 

The one dimensional (marginal) distributions P T (n) and F T (q) can be 
obtained from joint distribution P r (n, q) as follows: 

n t— 1 

P T (n) = ^P r (ri,g), P T (g) = £> T (n,g). 

g=0 n=q 

After substituting (14.361) into the above formulas the following expressions 
are obtained: 

m , x t + 1 — a 1 — a 

P T (n) = r- r. 4.38 

if < n < t and P r (n) = if n > r. Furthermore, 
r + l-al (l/a-l) 1/Q 



Pr(?) 



t a (q + 1/a - l) 1/a+1 

r+l-a(l/a-l) q ^ (-l) fc (-afc) T 

r (2-a)r ^ A;!(g- A;)!aA; + 2-a' l ' j 



if < q < t and F T (q) = otherwise. Rice's method [73j| has been applied 
to evaluate the first term of F T (q) in closed form. 

The ER limit of the marginal cluster size distribution can obviously be 
obtained from (14.381) at a = 0. Furthermore, the ER limit of the marginal 
in-degree distribution can be derived analogously to the limit of the joint 
distribution, shown in Appendix IA.3t 



limP T ( 5 ) T + 1 1 ■ T + 1 1 (1 + Q) - 



a^o ^' t 29+ 1 r r(r + 2)T(q)dai- 1 2- a 



(4.40) 



a=0 



If the size of the network grows beyond every limit, that is if r — > oo, 
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then the marginal distributions become much simpler: 

p ~(«> = (n + 1 -lH n a +2 - a ) < 4 ' 41) 
r „ lq) = 1 (4 . 42) 

a [1+ ya- l)i/a+i 
limPoo(g) = (4.43) 

The asymptotic behavior of the cluster size and in-degree distributions 
differ significantly. The tail of the cluster size distribution follows power law 
with exponent 2 either in BA or ER network, independently of a. However, 
we learned that the tail of the in-degree distribution follows power law with 
exponent l/a + l = 2 + ain BA networks, and it falls exponentially in ER 
topology, which agrees with the well known results of previous works [56| . 

It is worth noting that the mean cluster size diverges logarithmically as 
the size of the network tends to infinity: 

T-l 

E T [n) = ^nP r (fj) = (1 -a)lnr + C(l). (4.44) 

rc=0 

The expectation value of the in-degree, however, obviously remains finite: 
[ol = 7+i < 1> an d [q] = 1 if the size of the network is infinite. 
Moreover, the variance of the in-degree can also be given exactly when the 
size of the network grows beyond every limit: 

[(q - l) 2 ] = (4.45) 

This result implies that the fluctuations of the in-degree diverge in a bound- 
less network, if a = 1/2, that is in the classical BA model. 

My analytic results have been verified with computer simulations. Since 
cumulative distributions are more suitable to be compared with simulations 
than ordinary distributions I matched the corresponding complementary cu- 
mulative distribution function (CCDF) against simulation data. The CCDF 
of cluster size, F T (n) = Y7r7=n ^r{n') can be calculated straightforwardly: 

-,. r+1 — a 1 — a 1 — a 

F T (n) = — , 4.46 

r n + 1 — a t 
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Cluster size, n 



Figure 4.5: Figure shows comparison of empirical CCDFs of cluster size dis- 
tributions (points) with analytic formula (14.461) (lines) on logarithmic plots, 
at a = 0, and 1/2. Empirical distributions have been obtained from 10 
realizations of N = 10 6 size networks. 



where < n < r and < a < 1. The CCDF of in-degree, F T (q) 



^L = 1 „P T (? / ) is more complex, however: 



F ( \ = r + 1 ~ a (^"^lA* _ 1-Q 
r (g+l/a-l) l/a r 

r+l-a(l/a-l) g ^ (-l) k (1 - a - afc)^ 

r (2-a) T ^fc!(g-2-fc)!(A; + l/a)(fc + 2/a) l ' ' 

where < q < r and < a < 1. If the size of the network grows beyond 
every limit, then the CCDFs are the following: 

Foo{n) = j— ? , Foo(g) = k/ 7" , (4.48) 

ra+l-a (g + l/a-l) 1/a 

where < n, < q and < a < 1. 

Comparisons of analytic CCDF of cluster size (14.461) and empirical distri- 
butions are shown in Figure [4751 for a = 0, 1/3, 1/2, and 2/3. Experimental 
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Figure 4.6: Figure shows comparison of empirical CCDFs of in-degree distri- 
butions (points) with analytic formula ( 14.47j) (lines) on logarithmic plots, at 
a = 0, 1/3, 1/2, and 2/3. Empirical distributions have been obtained from 
10 realizations of N = 10 6 size networks. Inset: Comparison at a = on 
semi-logarithmic plot. 

data has been collected from 10 realizations of 10 6 node networks. Figure [4751 
shows that simulations fully confirm my analytic result. 

On Figure 14.61 analytic formula (I4.47K and the empirical CCDFs of in- 
degree, obtained from the same 10 6 node realizations, are compared. Note 
the precise match of the simulation and the theoretical distribution on almost 
the whole range of data. Some small discrepancy can be observed around 
the low probability events. This deviation is caused by the aggregation of 
errors on the cumulative distribution when some rare event occurs in a finite 
network. 

4.4.5 Conditional probabilities and expectation values 

In the previous sections exact joint and marginal distributions of cluster size 
and in-degree were analyzed for both finite and infinite networks. All these 
distributions provide general statistics of the network. In this section I pro- 
ceed further, and I investigate the scenario when the "younger" in-degree of 
a randomly selected link is known. I ask the cluster size distribution under 
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this condition, that is the conditional distribution F T (n \ q). The results of 
the previous sections are referred to below to obtain the conditional proba- 
bility distribution, and eventually the conditional expectation of cluster size. 
For the sake of completeness, the conditional distribution and expectation of 
in-degree are also given at the end of this section. 

The conditional cluster size distribution can be given by the quotient of 
the joint and the marginal in-degree distributions by definition: 

p ^<" I *) = W' (4 ' 49) 

The exact conditional distribution for any finite network can be obtained 
after substituting (14.360 and (14.391) into the above expression. For a boundless 
network the conditional distribution takes the simpler form: 

(2/a-l) 0+1 

Poo(n | q) = a-j- r^±i$„(n, q), (4.50) 

( 2 - a )n+i 

where < q < n. If n » 1, then P DO (n | q) ~ a (2/a - l) q+1 /n 3 + O (l/^ 4 ), 
that is the conditional cluster size distribution falls faster than the ordinary 
cluster size distribution. It follows that the mean of the conditional cluster 
size distribution will not diverge like the mean of the ordinary distribution. 

What is the expected size of a cluster under the condition that the in- 
degree of its root is known? For practical reasons, I do not calculate E r [n \ q] 
directly, but I calculate E T [n + 2 — a \ q] = E T [n \ q] + 2 — a instead: 

1 T_1 

E T [n + 2-a | q] = — ^ (n + 2 - a) P T (n, q). (4.51) 

Since (n + 2 — a) P r (n, q) = T+ ]r a ^(l^_a) q ®a(n, q), the above summation can 
be given similarly to the marginal distribution F T (q) in (14.391) : 



T-l 

E 



n + 2 - a) P T (n, q) 



r + 1 — a l/a — 1 



n=q 

T 



r q + l/a — 1 
+ l-a(l/a-l),^ (-1)" {-ak) T 



t (2-a) T _ 1 ^ Q k\(q-k)\ak + l-a 



4.4. DISCUSSION 



97 



After replacing the above sum in E T [n | q] , the following equation can be 
obtained: 

(q + 1/a), , 

E T [ n + 2 - a | q] = (1 - a) — r^G T (g), (4.52) 



where 



(l/a-l) g « (-l) fc (-a*), 



(1-a) ^ fc!(g-jfc)!jfc + l/a-l 

Gt W = 7777 7\ 5 — fc ; — rr • 4 -° 3 

(2/Q - A (-1) (~afc) r 

( 2 -«)r ^ Q k\(q- k)\k + 2/a-l 

The identity linx^oo G T (q) = 1 implies that G T (q) involves the finite scale ef- 
fects, and the factors preceding G T (q) give the asymptotic form of E T [n + 2 — a 

(q + 1/a)-, / 

Eoo [n + 2 - a | g] = (1 - a) }- - V " . (4.54) 

(1/a - l) 1/a 

It can be seen that the expectation of cluster size, under the condition that 
the in-degree is known, is finite in an unbounded network. This stands in 
contrast to the unconditional cluster size, discussed in the previous section, 
which diverges logarithmically as the size of the network grows beyond every 
limit. 

In the ER limit the expected conditional cluster size becomes 

limEoo [n + 2\q}= 2 q+1 . (4.55) 

The fundamental difference between the scale-free and non-scale-free net- 
works can be observed again. In the scale-free case the expected conditional 
cluster size asymptotically grows with the in-degree to the power of 1/a, 
while in the latter case it grows exponentially. On Figure 14.71 the exact ana- 
lytic formula ( 14.521) is compared with simulation results at a = 0, 1/3, 1/2, 
and 2/3. The simulations clearly justify my analytic solution. 

Let us briefly investigate the opposite scenario, that is when the cluster 
size is known and the statistics of the in-degree are sought under this condi- 
tion. The conditional distribution can be obtained from the combination of 
Eqs. (1431) . (14381) and the definition 



(4.56) 
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In-degree, q 



Figure 4.7: Figure shows the average cluster size as the function of the in- 
degree q, obtained from 100 realizations of 10 5 size networks. Simulation data 
has been collected at a = 0, 1/3, 1/2, and 2/3 parameter values. Analytical 
result (I4.52D of conditional expectation E r [n \ q] is shown with continuous 
lines. 

The conditional expectation of in-degree can be acquired by the same tech- 
nique as the conditional expectation of cluster size. Let us calculate 

E T [q + l/a - 1 | n) = E T [q\n) + l/a - 1 (4.57) 

instead of E r [q \ n] directly: 

1 n 

E T [q + l/a - 1 | n] = -— - (q + l/a - 1) P T (n, q) 

T ^ n ) q=0 

r(2-a) , . . . 

= n + 1 - a) a , (4.58) 

a 

where < n < r. Note that the conditional expectation of in-degree is 
independent of r, that is of the size of the network. In the ER limit the 
expectation of the in-degree becomes 

lim E r [q\n} = *(n + 1) + 7, (4.59) 
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Figure 4.8: Figure shows the average in-degree as the function of the cluster 
size n, obtained from 100 realizations of 10 5 size networks. Simulation data 
has been collected at a = 0, 1/3, 1/2, and 2/3 parameter values. Analytical 
result (14.581) of conditional expectation E T [q \ n] is shown with continuous 
lines. 

where ^(x) = ^lnr(x) denotes the digamma function, and 7 = — ^(1) ~ 
0.5772 is the Euler-Mascheroni constant. Asymptotically the expectation of 
the in-degree in a scale-free tree grows with the cluster size to the power 
of a, while in a ER tree it grows only logarithmically, since ^/(n + 1) = 
logn-l- O (1/n). Therefore, conditional in-degree and conditional cluster size 
are mutually inverses asymptotically. Figure 14.81 shows the analytic solution 
(14.581) and simulation data at a = 0, 1/3, 1/2, and 2/3 parameter values. 
Simulation data has been collected from 100 realizations of 10 5 size networks. 



4.4.6 Conditional distribution of edge betweenness 

Using the results of the previous sections, I am finally ready to answer the 
problem which motivated my work, that is the distribution of the edge be- 
tweenness under the condition that the in-degree of the "younger" node of 
the link is known. As I noted at the beginning of Section 14.4.11 the edge 
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betweenness can be expressed with cluster size: 

L= (n + 1) (r-n) . (4.60) 

Therefore, conditional edge betweenness can be given formally by the follow- 
ing transformation of random variable n: 

r-l 

P T (L \q) = J2 W)(r-n) p r(n | q). (4.61) 

n=0 

Obviously, f T (L \ q) is non-zero only at those values of L, where (14.601) has 
an integer solution for n. If 



r-l /(r + 1) 2 , . 

»L = — -\l^- r L -L (4-62) 

is such an integer solution of the quadratic equation ( 14.601) . and L ^ (r + l) 2 /4, 
then 

P T (L | q) = F T {n L \ q) + P r (r - 1 - n L \ q). (4.63) 

If L = (r + l) 2 /4 is integer, then P T (L | q) = ¥ T (n L \ q). 

The conditional expectation of edge betweenness can be obtained from 
(HO]): 

E T [L | q] = tE t [n + 1 | q) - E r \{n + 1) n \ q] . (4.64) 

Therefore, for the exact calculation of E r [L \ q] the first and the second 
moment of the conditional cluster size distribution are required. The first 
moment, that is the mean, has been derived in the previous section. In order 
to calculate the second moment let us use the technique I have developed in 
the previous sections. Let us consider: 

to u w _li M i r + l-a{l/a-l) q ^ $ a (n,g) 
Er [(n + 2-a)(n + l-a)|g] = — 

(4.65) 

We shall be cautious when the summation for n is evaluated. The k = 1 
term in $ a (n, q) = Yll=o J(g-fc)! ( — a ^)n mus t be treated separately to avoid 
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a divergent term: 



T-l 



n=g ^ '71— 1 ^ ' 

1 i (~i) fc (-qfc) r 

« (2 -«) T _ 2 ^ *!(?-*)! fc-1 

The exact formula for E T [L \ q] can be obtained straightforwardly, after 
(14.521) and the above expressions have been substituted into ( 14.641) . 

Let us consider the scenario when the size of the network tends to in- 
finity. Equation (14.601) implies that edge betweenness diverges as r — > oo, 
therefore L should be rescaled for an infinite network. From the asymp- 
totics of the digamma function \?(t — a) = lnr + 0{l/r) it follows that 
E T [(n + 2 — a) (n + 1 — a) \ q] grows only logarithmically, slower than the 
linear growth of rE T [n + 2 — a \ q}. Therefore, edge betweenness asymptoti- 
cally grows linearly as the size of the network grows beyond every limit. Let 
us rescale edge betweenness 

A r = i& (4.66) 

T + 1 

and let us consider the limit A = lim^oo A r = n\ + 1. The CCDF of the 
rescaled edge betweenness can be given by 

T— l-«A r ^ oo 

F 00 (A|g) = Jim £ Pr ( n | g ) = __ ^ Poo ( n , g ). ( 4 .67) 



n=«A T v ' n=A— 1 



Poo(g) 



When the summation has been carried out, the following equation is ob- 
tained: 

(2/a-l) 0+1 * (-l) fe (-aife) A , 

where g + 1 < A. If 1 < g A, then only the first term of the sum should 
be taken into account, and it is easy to see that 

It can be seen that the scaling exponent —2 is independent of a. The above 
asymptotic formula has been obtained for infinite networks. The same power 
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Rescaled edge betweenness, A 



Figure 4.9: Figure shows CCDF of edge betweenness under the condition 
that the in-degree q is known. Empirical CCDF has been obtained from 100 
realizations of N = 10 4 and N = 10 5 , and 10 realizations of N = 10 6 size 
networks at a = 1/2 parameter value. Continuous lines show analytic result 
of infinite network limit (14.681) . 

law scaling can be observed in finite size networks as (14.691) if A T <C r. 
However, -F T (A T | q) = if A r > r in finite networks, therefore asymptotic 
formula ( 14.691 ) evidently becomes invalid if A r « r. 

It is obvious that as the size of the network grows larger and larger, 
asymptotic formula (14.681) becomes more and more accurate. One can ask 
how fast this convergence is. From elementary estimations of -F T (A T | q) one 
can show that for fixed A T : 

F T (A T | q) = F^K | q) - (l - Foo(A r | q)) ^"^ l + O (l/r 2+a ) , 

(4.70) 

that is corrections to the asymptotic formula decrease with r~ 2 for large r. 

On Figure 14.91 comparison of analytic formula (14.681) with simulation re- 
sults is presented for q = 1 and q = 2. The empirical CCDF of rescaled edge 
betweenness, under the condition that in-degree q is known, is shown for 
10 4 , 10 5 , and 10 6 size networks, at a = 1/2 parameter value. The empirical 
CCDFs of rescaled edge betweenness evidently collapse to the same curve for 
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In-degree, q 



Figure 4.10: Figure shows average edge betweenness under the condition that 
the in-degree q is known as the function of q on log-log plot. Numerical data 
has been collected from 100 realizations of N — 10 4 and N = 10 5 , and 10 
realizations of N = 10 6 size networks at a = 1/2 parameter value. Inset 
shows the same scenario at a = parameter value on semi-logarithmic plot. 
Continuous lines show analytic results of the infinite network limit (14.711) and 
((422!). 



different size networks, and they coincide precisely with my analytic result. 

The expectation of the rescaled edge betweenness under the condition 
that in-degree q is known can be given by E^ [A | q] — Eoo [n\ + 1 | q] ■ Using 
(14.541) and (14.551) I receive 

Eoo [A I q] = (1 - a) y \ 1/a - 1 + a, (4.71) 

{l/a-l) 1/a 

lim Eoo [A | q\ = 2 q+1 - 1. (4.72) 

One can see that Eoo [A | q] ~ Q l/a for q > 1 if a > and E^ [A | q\ ~ e q for 
q > 1 if a -»• 0. 

Analytic results ( 14.711 ) and (14.72h . and simulation data are shown in Fig- 
ure [4?T0] at a = 1/2 and a = parameter values. Numerical data has been 
collected from the same 10 4 , 10 5 , and 10 6 size networks as above. As the 
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size of the network grows a larger and larger range of the rescaled empirical 
data collapses to the same analytic curve. On the high degree region some 
discrepancy can be observed due to the finite scale effects. 

Finally, let us note that the precise unconditional distribution of edge 
betweenness P r (L) = J2n=o ^i,(n+i)(r-n)Pr(^) can be obtained from (14.381) 
as well. Furthermore, CCDF of the unconditional betweenness F T (L) = 
^^Z"^ 1 P r (^) can be derived in closed form: 

K(L) = 1±1^ (l-a)(r-2n L ) 

rV ; r (n L + l-a)(r-n L + l-a) v ; 

For the sake of simplicity I have assumed during my calculations that in- 
degrees of the "younger" nodes are provided. However, it is possible that even 
though both in-degrees of every link are known, we cannot distinguish them 
from each other, that is we cannot tell which is the "younger" node. How 
could I extend my results to this scenario? Let us consider a new edge when 
it is connected to the network. The in-degree of the new node is obviously 
0. The in-degree of the other node, to which the new node is connected, is 
equal to or larger than one. Due to preferential attachment the larger the 
in-degree is the faster it grows. Even if preferential attachment is absent, 
the growth rate of every in-degree is the same. Therefore, it is expected that 
the initial deficit in the in-degree of the "younger" node grows or remains at 
the same level during the evolution of the network. It follows that it is a 
reasonable approximation to substitute the in-degree of the "younger" node 
q with q min = min(gi, q 2 ) in my formulas. 



4.5 Conclusions 

A typical network construction problem is to design network infrastructure 
without wasting precious resources at places where not needed. An appro- 
priate design strategy is to allocate network resources proportionally to the 
expected traffic. In a mean field approximation the expected traffic is pro- 
portional to the number of shortest paths going through a certain network 
element, that is the betweenness. 

The precise calculation of all the betweenness requires complete informa- 
tion on the network structure. In real life, however, the number of shortest 
paths is often impossible to tell because the structure of the network is not 



4.5. CONCLUSIONS 



105 



fully known. One of the practical results of this chapter is that the expecta- 
tion of edge betweenness can be estimated precisely when only limited local 
information on network structure — the in-degree of the "younger" node — is 
available. 

Another difficulty of network design is that the size of real networks is 
finite. Moreover, the size of real networks is often so small that asymptotic 
formulas can be applied only with unacceptable error. The other important 
novelty of my results is that the derived formulas are exact even for finite 
networks, which allows better design of realistic finite size networks. 

Various statistical properties of evolving random trees have been investi- 
gated in this chapter. I have focused on the cluster size, the in-degree and 
the edge betweenness. I have considered the m — 1 case of the BA model 
extended with initial attractiveness for modeling random trees. Initial at- 
tractiveness allows fine tuning of the scaling parameter. Moreover, in the 
limit of the tuning parameter a — > the applied model tends to a non-scale- 
free structure, which is in many aspects similar to the classical ER model. 
I was therefore able to investigate both the scale-free and the non-scale-free 
scenario within the same framework. 

I also presented conditional expectations of cluster size and in-degree for 
both finite and unbounded networks. I have found that asymptotically the 
conditional cluster size grows with in-degree to the power of 1/a and the con- 
ditional in-degree grows with cluster size to the power of a, respectively. The 
ER limit has been discussed as well. I have shown that the conditional cluster 
size grows exponentially and the conditional in-degree grows logarithmically 
when a — > 0. 

I have derived the distribution of edge betweenness under the condition 
that the corresponding in-degree is known. I have found that the condi- 
tional expectation of edge betweenness grows linearly with the size of the 
network. For the analysis of unbounded networks I have defined the rescaled 
edge betweenness A, and derived its distribution and expectation under the 
condition that in-degree q is provided. My analytic results have been veri- 
fied at different network sizes and parameter values by extensive numerical 
simulations. I have demonstrated that numerical simulations fully confirm 
my analytic results. 
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Chapter 5 

Concluding remarks 



The study of complex networks has evolved considerably in recent years. 
An interesting example of complex networks is the Internet, which has be- 
come part of everyday life. Two important aspects of the Internet, namely 
the properties of its topology and the characteristics of its data traffic, have 
attracted growing attention of the physics community. My thesis has consid- 
ered problems of both aspects. 

In the introduction I briefly presented an overview of the basic compo- 
nents of Internet structure and traffic. The workings of the Transmission 
Control Protocol (TCP), the primary algorithm governing traffic in the cur- 
rent Internet, were discussed in more detail, since they are the main focus of 
the first part of my analysis. Most of the terminologies I use in this thesis 
were also defined here. 

In the next chapter I studied the stochastic behavior of TCP in an ele- 
mentary network scenario consisting of a standalone infinite-sized buffer and 
an access link. This simple model might constitute the building blocks of 
more complex Internet traffic. I calculated the stationary distribution of the 
stochastic congestion window process, which regulates the traffic of TCP. 
My analysis not only considered the ideal congestion window dynamics, but 
also included the effect of the fast recovery and fast retransmission (FR/FR) 
algorithms of TCP. Furthermore, I showed that my model can be extended 
further analytically to involve the effect of link propagation delay, charac- 
teristic of Wide Area Networks. Various moments of the congestion window 
process were calculated. An important achievement is that all the parameters 
are at hand in the entire model, and no parameter fitting is necessary. I also 
applied the mean field approximation to describe many parallel TCP flows. 
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My analytic results were validated against packet level numerical simula- 
tions and the simulations agreed to a high degree with the analytic formulas 
I derived. 

I continued my thesis with the investigation of finite-sized semi-bottleneck 
buffers, where packets can be dropped not only at the link, but also at the 
buffer. I demonstrated that the behavior of the system depends only on a 
certain combination of the parameters. Moreover, an analytic formula was 
derived that gives the ratio of packet loss rate at the buffer to the total packet 
loss rate. This formula makes it possible to treat buffer-losses as if they were 
link-losses. I considered the effect of the FR/FR algorithms and I calculated 
the probability distribution of the congestion window for both Local and 
Wide Area Network scenarios. I showed that a sharp peak might appear 
in the window distribution due to the FR/FR mode of TCP. My analytical 
results matched numerical simulations properly in the case of large buffer 
sizes and small packet loss probabilities. In the opposite range of parameters, 
however, the slow start mechanism of TCP plays a more important role and it 
cannot be neglected completely from the precise description of the congestion 
window dynamics. Nevertheless, my calculations gave qualitatively correct 
results in these well. Hopefully, my methods, developed in this 

chapter, can be applied later for modeling the slow start mechanism. 

In the last part of my thesis I studied computer networks from a structural 
perspective. The scaling exponent of the node connectivity could be tuned in 
the network model that I investigated. In addition, the non-scale-free limit 
of the node connectivity could also be investigated. I demonstrated through 
fluid simulations that the distribution of resources, specifically the link band- 
width, has a serious impact on the global performance of a computer network. 
Then I analyzed the distribution of edge betweenness in a growing scale-free 
tree under the condition that a local property, the in-degree of the "younger" 
node of an arbitrary edge, is known in order to find an optimum distribution 
of link capacity. The derived formula is exact even for finite-sized networks. 
I also calculated the conditional expectation of edge betweenness, rescaled 
for infinite networks. My analytic results were compared to numerical simu- 
lations that confirmed my calculations appropriately. 
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Mathematical proofs of the 
applied identities 



A.l Series expansion of L(c)G(x) 

In this section the series expansion of L(c)G(x) in x is derived, where L(c) 
is defined in (|2.30l) and G(x) in (13.231) . We prove two lemmas first: 
Lemma 1 For c 6 1, c ^ 1 

n k n 1 

k=0 1=1 k=l 



Proof. We prove the lemma by induction for N. Indeed, for iV = 1 the 
formula is evidently true: 1 + = ]~> ^ ne induction hypothesis suppose 
that the formula is true for N. Then 
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Lemma 2 For c G [0, 1[ 

n— 1 oo oo 

ri( i - cfc )=E cfcn n c 1 -^)- ( a - 3 ) 



k=l k=0 l=k+l 



Proof. This lemma is proven by induction for n. For n = 1 Lemma 1 proves 
the formula. Indeed, multiply both sides of (1A.1I) by rijfc=i (l — c fc ). If c G 
[0, 1[ then we can take the iV — > oo limit, which provides exactly the formula 
to be proven. 

As the induction hypothesis let us suppose that the formula is true for n. 
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(A.4) 

which proves the formula for n + 1. □ 
Theorem For iGK anc? c G [0, 1[ 

oo 1 n 

L(c)G(x) = -^-n(l-c')^- (A.5) 

n=l ' |=1 



Proof. From the series expansion of the exponential function it follows that 
E(x) = e~ cx — e~ x = — "52nLi (~ 1)™ ^r^"' Let us substitute this expression 
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into the definition of G(x) = YlkLo^^^ ^) 11*1=1 TZ? : 
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where we applied the Lemma 2 in the last equation in order to prove the 
theorem. □ 

A. 2 Expansion of the Kronecker-delta function 

We have seen that the general solution of Eq. (14.271) is 

Pr(n, q\r e )=J2 C ^M f{r) g{n) %), (A.7) 

Ai,A2 

and the initial condition is P Te ( n , <? I T e) = ^n,o^q,o? where 

^ m ={ lj ifn = m ' (A.8) 
10, if n 7^ m 

is the Kronecker-delta function, and n and m are integers. Coefficients C\ lt \ 2 
are calculated in this section. First we show that 

fc=0 v ' 

Note that we can consider m = without any loss of generality, since 5 n>m = 

3n— m,0 ■ 

If n < 0, then the summand in (IA.9I ) is indeed zero by definition. If n > 0, 
then 
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follows from the binomial theorem. Finally, for n = 



^ (-if i _ (-i) u i 
^ k\ r(-fc + i) _ o! r(i) ~ ' ( ' > 

k=0 \ ' / \ ) 

Coefficients C\ lt \ 2 can be obtained from the term by term comparison of 
F Te (n, q | r e ) — \ 2 C\ lt \ 2 f{r e ) g{n) h(q) with the expansion of the initial 
condition 5 nj o$ 9 ,0) shown above. One can easily confirm with the help of 
identity f(n)S nt0 = f(0)d n> o that the same terms appear on both sides, if 
Ai = — ki, and A2 = — ak 2 , and coefficients Ck ly k 2 are the following: 

r (-i) fcl+fc2 r(r e + i- Q ) 1 1 

fcl ' fc2_ k x \k 2 \ T( Te -h) r(-aJfe 2 )r(l/a-l)' [ ' 

Finally, to obtain (I4.36P the summation for k\ can be carried out explic- 
itly: 

» (-l) fcl r(r-fci) = r(r-r e + l) T(r-n) 

k^o W T ( n -^1 + 1) T(r e - h) T(n + l)r(r e ) T(r - r e - n + 1) 

A. 3 The a — ► limit of joint distribution P r (n, q) 

In this section we prove that the ER limit of the joint probability P r (n, q) is 
(14371) . 



Theorem Ze£ us consider ¥ T (n,q) as defined in U-36\ ), where < q < 
n < r are integers. Then the following limit holds: 



k=g-l 



Proof. First, let us note that $ a (n,q) in (14.36H can be rewritten in the fol- 
lowing equivalent form: Q> a (n, q) = a Ylk=o ' ^J^-"-!)?" -1 ■ Next, Pochham- 
mer's symbol (1/a — 1) is replaced with its asymptotic form: (1/ot — 1) = 
1/a 9 (1 + O (a)). After the obvious limits have been evaluated the following 
equation is obtained: 

v-g-l (- 1 ) fc ( 1 - Q - Qfc )n-i 

hm P T (n, o) = —^7 r hm . (A. 14) 
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The above limit, by definition, can be substituted with q — 1 order differ- 
ential at a = 0, if all the lower order derivates of the sum are zero at a — 0. 
Indeed, 
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where the sum is if m < q — 1 and 1 if m — q — 1. Therefore, the limit can 
be transformed to 
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Finally, let us consider the power expansion of Pochhammer's symbol: (x) m = 

are the Stirling numbers of the first kind. 
The expansion formula has been applied at x — 1 +a and m = n — 1, which 
implies 
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Glossary 



ACK acknowledgment packet. 13, 16-20, 22, 23, 30, 37, 41, 42, 56, 65 

ns-2 Network Simulator version 2. 26-27, 29, 38, 49, 56, 62, 68 

TCP Transmission Control Protocol. 2, 3, 7-9, 11, 12-18, 19-25, 28-30, 
37, 41, 42, 44, 45, 48, 56, 57, 65, 85 

UDP User Datagram Protocol. 7, 11, 12, 28 

AIMD additive increase, multiplicative decrease. 17, 80, 82, 83 
AS Autonomous System. 4, 5, 72, 76 

BA Barabasi-Albert model. 72, 73-76, 77-79, 83, 91, 93 

CA congestion avoidance. 15—16, 37, 41, 42 

CCDF complementary cumulative distribution function. 94, 95, 101-103 
CDF cumulative distribution function. 85 
cwnd congestion window. 13-17 

ER Erdos-Renyi model. 72, 73, 79, 91, 93, 97, 98 
ERD Early Random Drop. 9, 23, 25, 27 

FR/FR fast recovery, fast retransmission. 15, 16-17, 29, 36-38, 41, 42, 64, 
68, 80 

IID independent and identically-distributed. 10, 54 
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116 Glossary 

IP Internet Protocol. 5, 7, 8, 10, 11, 47 
ISP Internet Service Provider. 4, 5 

LAN Local Area Network. 9, 27, 29, 31, 35, 41, 42, 44, 45, 48, 49, 55, 59, 
62, 64, 68 

RED Random Early Detection. 9, 23-25, 27 
RTO retransmission timeout. 13, 18 
RTT round-trip time. 13, 14, 16-18, 24, 56 
rwnd receiver's advertised window. 14 

ssthresh slow start threshold. 15-17 

WAN Wide Area Network. 9, 10, 27, 31, 38, 40-42, 44, 45, 55-57, 62, 65, 
68 

WWW World Wide Web. 4, 5, 10, 11, 13 
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Summary 



The study of complex networks has evolved considerably in recent years. 
An interesting example of complex networks is the Internet, which has be- 
come part of everyday life. Two important aspects of the Internet, namely 
the properties of its topology and the characteristics of its data traffic, have 
attracted growing attention of the physics community. My thesis has consid- 
ered problems of both aspects. 

First I studied the stochastic behavior of TCP, the primary algorithm 
governing traffic in the current Internet, in an elementary network scenario 
consisting of a standalone infinite-sized buffer and an access link. I calcu- 
lated the stationary distribution of the stochastic congestion window process, 
which regulates the traffic of TCP. My analysis not only considered the ideal 
congestion window dynamics, but also included the effect of the fast recov- 
ery and fast retransmission (FR/FR) algorithms. Furthermore, I showed 
that my model can be extended further to involve the effect of link propa- 
gation delay, characteristic of WAN. An important achievement is that no 
parameter fitting is necessary in my model. I also applied the mean field 
approximation to describe many parallel TCP flows. After having been vali- 
dated against packet level numerical simulations, my analytic results agreed 
almost perfectly. 

I continued my thesis with the investigation of finite-sized semi-bottleneck 
buffers, where packets can be dropped not only at the link, but also at the 
buffer. I demonstrated that the behavior of the system depends only on a 
certain combination of the parameters. Moreover, an analytic formula was 
derived that gives the ratio of packet loss rate at the buffer to the total packet 
loss rate. This formula makes it possible to treat buffer-losses as if they were 
link-losses. I calculated the probability distribution of the congestion window 
for both LAN and WAN scenarios. I showed that a sharp peak might appear 
in the window distribution due to the FR/FR mode of TCP. My analytical 
results matched numerical simulations properly. 

In the last part of my thesis I studied computer networks from a structural 
perspective. I demonstrated through fluid simulations that the distribution 
of resources, specifically the link bandwidth, has a serious impact on the 
global performance of the network. Then I analyzed the distribution of edge 
betweenness in a growing scale-free tree under the condition that a local 
property, the in-degree of the "younger" node of an arbitrary edge, is known 
in order to find an optimum distribution of link capacity. The derived for- 
mula is exact even for finite-sized networks. I also calculated the conditional 
expectation of edge betweenness, rescaled for infinite networks. 



Osszefoglalas 



Az elmult evekben a komplex halozatok kutatasa rendkiviil sokat fe- 
jlodott. A komplex halozatok egyik legerdekesebb peldaja a mara a minden- 
napi elet reszeve valt internet. Az internet ket fontos teriilete - a topologiaja- 
nak tulajdonsagai es a rajta folyo adatforgalom jellemzoi - irant az utobbi 
idoben a fizikusok koreben is egyre nagyobb az erdeklodes. Dolgozatomban 
az emlitett ket teriilet nehany kerdeset vizsgaltam. 

Elsokent a jelenlegi internet legfontosabb forgalomszabalyozo algoritmusa- 
nak, a TCP-nek a sztochasztikus viselkedeset tanulmanyoztam egy elemi 
halozati konfiguracioban, mely egy egyediilallo bufferbol, es egy hozza kapc- 
solodo vezetekbol allt. Meghataroztam a TCP-forgalmat szabalyozo tor- 
lodasi ablaknak a stacionarius eloszlasat. Vizsgalatomban nem csupan az 
idealis torlodasi ablak dinamikat tekintettem, hanem figyelembe vettem a 
FR/FR (fast recovery/fast retransmission) algoritmusok hatasat is. Meg- 
mutattam tovabba, hogy a modellem hogyan altalanosithato ugy, hogy a 
vezetekeken fellepo csomagkesleltetest is figyelembe vegye. Fontos eredmeny, 
hogy nem sziikseges ismeretlen parametert illenszteni a modellben. A mod- 
ellt parhuzamosan mukodo TCP-k leirasara atlagter-kozelitesben alkalmaz- 
tam. Az analitikus eredmenyeket csomag-szintu numerikus szimulaciokkal 
osszevetve rendkiviil jo egyezest kaptam. 

A disszertaciot veges meretu bufferek vizsgalataval folytattam, ahol a 
csomagok nem csak a vezeteken, hanem a bufferben is elveszhetnek. Meg- 
mutattam, hogy a rendszer viselkedese csupan a parameterek egy bizonyos 
kombinaciojatol fiigg. Levezettem tovabba egy analitikus formulat, mely 
megadja a bufferben eldobott, es a vezeteken elveszett csomagok aranyat. 
Ez a formula lehetove teszi, hogy a bufferben torteno csomagveszteseket tigy 
tekintsiik, mintha az a vezeteken tortent volna. Kiszamoltam a torlodasi 
ablak eloszlasat mind lokalis (LAN), mind tag (WAN) halozati kornyezetben. 
Megmutattam, hogy az FR/FR algoritmusok miatt egy eles csucs jelenik meg 
az eloszlasban. Az analitikus eredmenyek jol egyeztek a szimulaciokkal. 

Dolgozatom utolso reszeben szerkezeti szempontbol vizsgaltam komplex 
szamitogepes halozatokat. Folyadekkozelitesu szimulaciokkal bemutattam, 
hogy az eroforrasok, kiilonoskeppen a vezetekek savszelessegenek elosztasa, 
jelentosen befolyasolja a halozat osszteljesitmenyet. Ezutan annak erdekeben, 
hogy az el-kapacitasok optimalis elrendezeset meghatarozzam, az el-koztesseg 
eloszlasat vizsgaltam novekvo skala-fiiggetlen faban azzal a feltetellel, hogy 
tetszoleges el „fiatalabb" csucsanak bejovo fokszama ismert. A levezetett for- 
mula meg veges meretu halozatokra is egzakt. Vegiil megadtam a vegtelen 
halozatokra atskalazott el-koztesseg felteteles varhato erteket. 



